Skip to main content

Developer APIs & Resources

TritonAI Developer API

Access enterprise-grade large language models (LLMs) through UC San Diego's secure and centralized API gateway.

Overview

The TritonAI Developer API gives UC San Diego faculty, staff, researchers, and campus teams programmatic access to curated large language models through a secure, centralized LLM gateway powered by LiteLLM. The gateway connects approved AI development tools and application stacks to commercial cloud providers and SDSC-hosted open-source models, while preserving campus standards for data protection, authentication, logging, and responsible use.

The program is designed for practical AI development: prototype with approved AI coding tools, connect to the gateway, build with reusable campus patterns, and then move completed artifacts into the right hosting lane for review, support, and long-term operation.

TritonAI API Program

Diagram of the TritonAI Developer API Program showing campus users using AI development tools that connect to the LLM Gateway, which accesses cloud and SDSC-hosted model providers for chat, reasoning, vision, image generation, OCR, and coding.
TritonAI provides a managed gateway between campus users, approved AI development tools, model providers, and governed delivery patterns.

How the Program Works

The Developer API is more than model access. It is a campus development path for AI-enabled applications, from early experimentation through production hosting.

  • Request access: Describe the use case, intended audience, campus data involved, and expected model needs.
  • Receive API credentials: Approved projects receive an API key, starter credits, templates, and guidance for using approved tools and integration patterns.
  • Build with campus patterns: Use approved stacks, campus authentication, curated integrations, and documented agent or application templates.
  • Host appropriately: Move built artifacts into the right hosting lane, from individual sandboxes to department-managed applications or enterprise delivery.

ITS owns the shared gateway, templates, baseline risk review, usage tracking, hosting infrastructure, SSO, logs, and the right to remove unsafe or unsupported applications. Departments and project teams own their application logic, dependencies, accessibility and testing, end-user support, and a named technical point of contact.


Model Hosting & Data Protection

The TritonAI Developer API provides access to models through multiple hosting environments, giving teams a way to choose the right model path based on data sensitivity, capability needs, and operational requirements.

Cloud Provider Models

We offer access to leading commercial models from major cloud providers, including Microsoft Azure, Google Cloud Platform, and Amazon Web Services, operating under UC San Diego's campus-wide enterprise agreements.

These agreements provide enterprise data protection terms that meet UC system requirements, ensure prompts and responses are not used for model training, and include contractual safeguards that individual accounts cannot obtain.

SDSC Self-Hosted Models

For projects requiring the highest level of data control, TritonAI also provides access to locally hosted open-source and open-weight models running within UC San Diego infrastructure at the San Diego Supercomputer Center.

With self-hosted models, inference processing occurs on campus infrastructure. This makes them appropriate for research involving sensitive data, projects with heightened privacy considerations, or use cases where on-premises processing is preferred or required.


Built Artifacts Need a Hosting Lane

API access can support experimentation, but durable campus tools need an operating model. As projects move from individual development to shared use, they may require scope review, recurring risk review, authentication, logging, support ownership, and migration into a more formal hosting environment.

Tiered hosting lane diagram for TritonAI-built artifacts, showing individual users, scattered users, many users, and campus-wide usage with increasing governance and hosting requirements.
As usage grows, projects move from citizen-initiated development toward reviewed, supported, and governed hosting lanes.

Small prototypes may remain on a user desktop, laptop, or sandbox. Tools used by a department or repeated audience should move into a reviewed campus application lane. Broad or mission-critical use cases require enterprise architecture, recurring review, support planning, and a named technical owner.


Available Models & Pricing

We offer a curated selection of large language models to meet diverse needs and budgets. The model catalog is updated as the AI landscape evolves and includes current availability, pricing, capabilities, and hosting environment.

View Complete Model List & Pricing

The model catalog includes options across multiple providers and capabilities:

  • Chat and reasoning models from Anthropic, OpenAI, Google, Mistral, and others
  • Vision-enabled models for image understanding and analysis
  • Image generation models for creating visual content
  • Specialized models for coding, OCR, and advanced reasoning tasks
  • Self-hosted models for maximum data control

Pricing is based on token usage. The model hub displays current per-token pricing, context window sizes, available features, and hosting environment.


Technical Details

API Compatibility

The LiteLLM gateway provides an OpenAI-compatible API interface, making it easier to integrate with existing tools, libraries, and codebases. If your application already works with the OpenAI API, it can typically work with TritonAI with minimal modifications.

Authentication

API access is secured through API keys issued upon approval. Keys should be stored securely and should never be committed to version control or shared publicly.

Rate Limits

Default rate limits are designed to support typical development and production workloads. Projects requiring higher throughput can request limit increases.

Documentation & Support

Approved users receive access to API documentation, code examples, integration guidance, and support resources for common development patterns.


Billing & Recharge Rates

Free Credits

All approved users receive $15 per month in free API credits for use with self-hosted models running on UC San Diego infrastructure. Free credits refresh monthly and are non-transferable.

Recharge Rates

Usage beyond monthly free credits for self-hosted models, as well as cloud provider model usage through Azure, Google Cloud, and AWS, is billed at pass-through cost to the designated chart string. Recharge rates reflect the actual cost of model access plus the infrastructure fee needed to sustain the service.

Project & Task Tracking

When requesting access, you will provide project and task information that maps to your chart string. This supports cost allocation, reporting, and budgeting for grant-funded or department-funded projects.

Usage Alerts

Usage alerts can notify teams when spending reaches defined thresholds, helping projects stay within budget and avoid surprises.

Ready for next steps?

View the Complete Model List & Pricing or read through Frequently Asked Questions.

Get Started

Stay Informed

To receive the latest announcements and news, subscribe to the TritonAI mailing list.

Subscribe