⚠️

Model No Longer Available

This model has been delisted from the API. This page is preserved for reference purposes only.

View Alternatives
G

gemini-1.5-flash-8b

1000k ContextTextPopular

Google's gemini-1.5-flash-8b - compact and efficient with budget-friendly pricing. Google's advanced AI research.

Pricing / 1M
USD
Input$0.6
Output$2.4
Context Window
1000ktokens
Avg Latency
450ms

Capability Radar

Compare with Claude 3

About gemini-1.5-flash-8b

"Cutting-edge multimodal capabilities"

gemini-1.5-flash-8b is Google's budget-friendly model featuring focused context window (1M). Cutting-edge multimodal capabilities, this model delivers industry-leading context windows.

Key Strengths

  • Cutting-edge multimodal capabilities
  • Best price-to-performance ratio
  • Optimized for low-latency responses
  • OpenAI SDK compatible
  • Production-ready reliability

🎯 Best Use Cases

  • Real-time chatbots and customer support
  • High-volume content moderation
  • Quick classification and tagging
  • Full codebase analysis and refactoring
  • Multi-document research and synthesis
  • Long-form content generation

Technical Specifications

Context Window
1M
Max Output
4,096 tokens
Knowledge Domain
Chat, Text
Provider
Google

Quick Start Guide

example.tsTypeScript
import OpenAI from "openai";

const openai = new OpenAI({
  baseURL: "https://api.okrouter.com/v1",
  apiKey: "<YOUR_OKROUTER_KEY>",
});

async function main() {
  const completion = await openai.chat.completions.create({
    model: "gemini-1.5-flash-8b",
    messages: [
      { role: "user", content: "Hello! What can you do?" }
    ],
  });

  console.log(completion.choices[0].message);
}

main();

Getting started with gemini-1.5-flash-8b is easy. Since OkRouter is fully compatible with the OpenAI SDK, you can use existing libraries to access this model immediately. gemini-1.5-flash-8b is fully compatible with the OpenAI Chat Completions API. distinctive features include Expanded context window and Text support.

Interactive Playground

Parameters Setup

0.7

Controls randomness: Lowering results in less random completions.

Return data chunks as they arrive

request.js
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.okrouter.com/v1",
  apiKey: process.env.OKROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "gemini-1.5-flash-8b",
  messages: [{ role: "user", content: "Hello world!" }],
  temperature: 0.7,
  stream: true,
});

Frequently Asked Questions

?How much does gemini-1.5-flash-8b cost?

gemini-1.5-flash-8b costs $0.60 per 1M input tokens and $2.40 per 1M output tokens via OkRouter. This is 20% cheaper than OpenRouter with the same great performance.

?Is gemini-1.5-flash-8b compatible with OpenAI SDK?

Yes! gemini-1.5-flash-8b works seamlessly with all OpenAI SDKs (Python, JavaScript, Go, etc.). Simply change your base_url to api.okrouter.com and use your OkRouter API key. No code changes needed.

?What can I do with gemini-1.5-flash-8b's large context window?

With 1M context, you can process entire codebases, analyze multiple documents simultaneously, generate long-form content, and build sophisticated RAG applications without chunking.