Heroku Managed Inference and Agents
Simplify AI in your applications through managed inference, agents, and the power of an AI PaaS
Get Started NowAI integration made easy with Managed Inference and Agents
Heroku Managed Inference and Agents provides a managed inference service, delivering powerful models and tools as a trusted, secure, and scalable service optimized for developers. Simplify your AI operations and focus on building your application, not managing infrastructure.
View Documentation Heroku Managed Inference and Agents Plans
Why Heroku Managed Inference and Agents?
Leading AI models, managed for you
Access an opinionated set of models from the world’s top AI providers, chosen for their generative power and performance. Whether you’re exploring AI for the first time or building enterprise-grade solutions, Heroku Managed Inference and Agents provides the right models optimized for ease of use and efficacy in the domains you need most.
Performance and trust
Benefit from Heroku’s world-class infrastructure that delivers high performance and secure AI services. We handle the complexity and overhead of operating AI infrastructure and systems, so you can focus on your core business needs with confidence.
Graceful developer experience
Adding AI to your application is as simple as attaching the Heroku Managed Inference and Agents add-on to your app or using heroku ai:models:create
in the CLI. Test and evaluate models directly from the command line with heroku ai:models:call
, making it easy to optimize prompts and debug interactions without complex setups.
Seamless application integration
Heroku Managed Inference and Agents provides environment variables for selected models, making it seamless to call from within your application. Config vars make it easy to securely connect to your Heroku Managed Inference and Agents endpoints, so you can focus on building features rather than managing infrastructure.
Powerful agentic capabilities
Create AI applications that can take action and interact with your systems. Using MCP with Heroku Managed Inference and Agents enables agents to access external tools, manipulate data, and execute functions within your application. Design AI assistants that can analyze spreadsheets, query databases, call APIs, and generate reports – all while respecting your security boundaries and business logic.
Focus on value, not complexity
Just as Heroku simplified application deployment and scaling, Heroku Managed Inference and Agents makes AI accessible so you can focus on the value of your AI-enhanced applications rather than taking on the complexity of operating rapidly evolving AI technology.

Get started now with Heroku Managed Inference and Agents
- Create a Heroku app
- Attach Heroku Managed Inference and Agents add-on to your app
- Access top AI models
- Deploy your code to your app
- Build AI-powered experiences
Build intelligent apps
Intelligence at the core
Embed AI capabilities directly into your application’s core logic. Heroku Managed Inference and Agents enables you to integrate AI models for text generation, reasoning, and content creation with minimal overhead. Build apps that adapt to user needs, generate dynamic content, and deliver personalized experiences with high performance and predictable pricing.
Multi-model, multi-purpose
Access a curated selection of leading AI models tailored for different needs and budgets. Choose language models optimized for speed, quality, or cost. Heroku Managed Inference and Agents’ unified access pattern makes it easy to switch between models or use different models for different parts of your application, all while maintaining a consistent developer experience.
Agentic applications
Create AI applications that can take action and interact with your systems. Using MCP with Heroku Managed Inference and Agents enables agents to access external tools, manipulate data, and execute functions within your application. Design AI assistants that can analyze spreadsheets, query databases, call APIs, and generate reports – all while respecting your security boundaries and business logic.
Heroku Managed Inference and Agents models
Claude Sonnet 3.7
Advanced language model for complex reasoning and creative tasks.- High accuracy
- Complex reasoning
- Creative generation
- Tool Calling (MCP)
Claude Sonnet 3.5
A state-of-the-art large language model that supports chat and tool-calling.
- General purpose
- Cost-effective
- Reliable performance
- Tool Calling (MCP)
Claude Haiku 3.5
A faster, more affordable large language model that supports chat and tool-calling.- Fast inference
- Low latency
- Resource efficient
- Tool Calling (MCP)
Claude Haiku 3
A faster, more affordable large language model that supports chat and tool-calling.- Basic tasks
- Cost-effective
- Quick deployment
- Tool Calling (MCP)
Cohere Embeddings
A state-of-the-art embedding model that supports multiple languages. This model is helpful for developing Retrieval Augmented Generation (RAG) search.- Semantic search
- Text similarity
- High-dimensional vectors
Stable Diffusion XL
A state-of-the-art diffusion (image generation) model.
- High-quality images
- Creative generation
- Customizable outputs