GPU Extensions

Bring your own API keys. Pauhu® routes to the best provider. You pay the provider directly.

How It Works

GPU Extensions are a zero-cost multi-provider API gateway. You provide your own API keys for external providers (OpenAI, Anthropic, Google, Replicate, etc.), and Pauhu handles routing, rate limiting, usage tracking, and a unified API surface. You pay the provider directly for compute — Pauhu charges only for the integration layer via your subscription tier.

Your Application
      ↓
Pauhu GPU Gateway (Cloudflare Worker, EU jurisdiction)
  • JWT authentication
  • Tier-based rate limiting
  • API key format validation
  • Usage tracking (D1)
      ↓
External Provider (customer-paid)
  • OpenAI, Anthropic, Google, Replicate, etc.
  • You provide the API key
  • You pay the provider directly
      ↓
Response → Your Application

Policy Boundaries

Extensions operate under strict policy boundaries:

6 Extension Types

1. Large LLMs (70B+ Parameters)

Endpoint: /gpu/large-llms/chat

Chat completions with large language models that exceed browser-native capacity. Providers: OpenAI, Anthropic, Google Gemini, Together AI, Groq, Replicate.

POST /gpu/large-llms/chat
{
  "model": "gemini-1.5-pro",
  "messages": [{"role": "user", "content": "Translate to Finnish"}],
  "api_key": "YOUR_GOOGLE_API_KEY",
  "provider": "google"
}

2. Video Generation

Endpoint: /gpu/video-generation/generate-video

Text-to-video and image-to-video generation. Providers: OpenAI (Sora), Replicate, RunwayML, Pika, Fal.ai. Cost: $0.002–$0.20 per second (customer-paid).

3. Image Generation

Endpoint: /gpu/image-generation/generate-image

Text-to-image generation. Providers: OpenAI (DALL-E 3), Replicate, Fal.ai, Together AI. Cost: $0.001–$0.04 per image (customer-paid).

4. Real-time Video

Endpoint: /gpu/realtime-video/process-frame

Real-time object detection and video analysis. Providers: Roboflow (YOLOv8), Ultralytics, AWS Rekognition, Replicate. Cost: $0.00001–$0.12 per frame/minute (customer-paid).

5. Audio Generation

Endpoint: /gpu/audio-generation/generate-music

Music generation and text-to-speech. Providers: Suno (music), ElevenLabs (speech), Replicate, Stability AI, Mubert. Cost: $0.02–$0.50 per generation (customer-paid).

6. 3D Generation

Endpoint: /gpu/3d-generation/generate-3d

Text-to-3D and image-to-3D model generation with textures, rigging, and LODs. Providers: Trellis (Microsoft), Meshy, Luma AI, Rodin, Stability AI, Replicate. Cost: $0.05–$2 per model (customer-paid).

POST /gpu/3d-generation/generate-3d
{
  "prompt": "A medieval fantasy knight with armor",
  "api_key": "YOUR_RODIN_API_KEY",
  "provider": "rodin",
  "output_format": "glb",
  "with_textures": true,
  "with_pbr": true
}

Pricing

Tier Monthly Fee Generations / Month
Free$010
Starter$49100
Professional$1991,000
EnterpriseCustomUnlimited

These prices cover the Pauhu integration layer only. You pay the external provider separately for compute (LLM tokens, GPU time, etc.) using your own API key.

Authentication

All GPU extension requests require two credentials:

Next

© 2026 Pauhu Ltd. All rights reserved. Terms · Privacy · Imprint · Attributions