fal.ai

Evaluation for teams building on fal.ai

Frametail wraps the fal client for automatic tracing and gives you benchmarks and scorers so every fal generation can be compared on pinned datasets.

Why fal teams add Frametail

fal.ai handles inference — fast queues, model endpoints, and storage for generated clips. Frametail sits on the other side of that call: spans with latency and errors, datasets you pin once, and benchmarks that stay immutable week over week.

If you are shipping a generative video feature on fal, you already feel the gap: playground clips do not become release artifacts, and “it looks better” does not survive a stakeholder review. Frametail closes that loop without replacing fal.

What gets traced

The SDK wraps `@fal-ai/client` so each `run` or `subscribe` call exports spans with inputs, outputs, latency, and errors. Spans land beside video artifacts in the trace viewer — not as a wall of syslog detached from the clip.

From there you can sample production traffic with live scoring rules, or run a full benchmark when you are ready to sign off.

Typical fal workflow

Wire tracing in development, agree on a scorer roster with your PM, pin a dataset of representative prompts, and run a benchmark before you change model endpoints in production. When regressions show up, the trace explains which span failed — not just that “quality dropped.”

Workflow

  1. Install `frametail` and `@fal-ai/client`, call `initializeFalClient` with tracing enabled.
  2. Generate video through fal as usual — spans export automatically.
  3. Review traces beside artifacts; iterate on prompts and parameters in experiments.
  4. Pin a dataset and run an immutable benchmark with your scorer contract.
  5. Share the benchmark link in release review or PRs.

Read the technical setup guide →