# Inference

Tutorials for serving models and building inference applications as Flyte apps.

### [Voice customer-service agent](https://www.union.ai/docs/v2/union/tutorials/inference/voice-customer-service/page.md)

Serve an LLM with vLLM and a browser voice UI as two composed Flyte apps, with switchable text-to-speech and a live latency comparison.

## Subpages

- [Voice customer-service agent](https://www.union.ai/docs/v2/union/tutorials/inference/voice-customer-service/page.md)
  - How it fits together
  - Serving the model
  - The voice UI app
  - The agent and the proxy
  - Speech in, speech out
  - What makes this a good Flyte app
  - Deploy
  - Test without a microphone
  - Going further

---
**Source**: https://github.com/unionai/unionai-docs/blob/main/content/tutorials/inference/_index.md
**HTML**: https://www.union.ai/docs/v2/union/tutorials/inference/
