LlamaIndex AI Integration

Installation and Setup

To install the integration, run:

pip install llama-index-llms-heroku

To set up LlamaIndex:

Create an app in Heroku:

heroku create example-app

Create and attach a chat model to your app:

heroku ai:models:create -a example-app claude-4-5-haiku

Export configuration variables:

export INFERENCE_KEY=$(heroku config:get INFERENCE_KEY -a example-app)
export INFERENCE_MODEL_ID=$(heroku config:get INFERENCE_MODEL_ID -a example-app)
export INFERENCE_URL=$(heroku config:get INFERENCE_URL -a example-app)

Using the Integration

Available Models

For a complete list of available models, see Managed Inference and Agents API Model Cards.

Chat Completion Example

from llama_index.llms.heroku import Heroku
from llama_index.core.llms import ChatMessage, MessageRole

# Initialize the Heroku LLM
llm = Heroku()

# Create chat messages
messages = [
    ChatMessage(
        role=MessageRole.SYSTEM, content="You are a helpful assistant."
    ),
    ChatMessage(
        role=MessageRole.USER,
        content="What are the most popular house pets in North America?",
    ),
]

# Get response
response = llm.chat(messages)
print(response)

Using Environment Variables

The integration automatically reads environment variables:

import os

# Set environment variables
os.environ["INFERENCE_KEY"] = "your-inference-key"
os.environ["INFERENCE_URL"] = "https://us.inference.heroku.com"
os.environ["INFERENCE_MODEL_ID"] = "claude-4-5-haiku"

# Initialize without parameters
llm = Heroku()

Parameters

You can pass parameters directly:

import os

llm = Heroku(
    model=os.getenv("INFERENCE_MODEL_ID", "claude-4-5-haiku"),
    api_key=os.getenv("INFERENCE_KEY", "your-inference-key"),
    inference_url=os.getenv(
        "INFERENCE_URL", "https://us.inference.heroku.com"
    ),
    max_tokens=1024,
)

Text Completion Example

response = llm.complete("Explain the importance of open source LLMs")
print(response.text)

Error Handling

The integration includes error handling for common issues, including:

Missing API key
Invalid inference URL
Missing model configuration

Get started

Core concepts

Agents

Tools

Evaluation

Integrations

Reference

Cookbook

LlamaIndex AI Integration

Installation and Setup

Using the Integration

Available Models

Chat Completion Example

Using Environment Variables

Parameters

Text Completion Example

Error Handling

Additional Resources

Get started

Core concepts

Agents

Tools

Evaluation

Integrations

Reference

Cookbook

​Installation and Setup

​Using the Integration

​Available Models

​Chat Completion Example

​Using Environment Variables

​Parameters

​Text Completion Example

​Error Handling

​Additional Resources

Installation and Setup

Using the Integration

Available Models

Chat Completion Example

Using Environment Variables

Parameters

Text Completion Example

Error Handling

Additional Resources