Gemma 2 9B Model Implemented

We are pleased to announce the implementation of the latest model: google/gemma-2-9b-it.

About Gemma 2 9B model:

Gemma 2 9B Instruction tuned model is a Cutting-edge text generation LLM for text understanding, transformation, and code generation.

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models. They are text-to-text, decoder-only large language models, available in English, with open weights for both pre-trained variants and instruction-tuned variants. Gemma models are well-suited for a variety of text generation tasks, including question answering, summarization, and reasoning.

With MonsterAPI's Gemma 2 9B IT model API, you can instantly integrate it in your application and apply it for various use-cases as mentioned above.

To get started, you may follow this code:

from openai import OpenAI

client = OpenAI(
  base_url = "https://llm.monsterapi.ai/v1/",
  api_key = "YOUR_MONSTERAPI_KEY"
)

completion = client.chat.completions.create(
  model="google/gemma-2-9b-it",
  messages=[{"role":"user","content":"Explain special theory of relativity as if I am 5 year old."}],
  temperature=0.2,
  top_p=0.7,
  max_tokens=1024,
  stream=True
)

for chunk in completion:
  if chunk.choices[0].delta.content is not None:
    print(chunk.choices[0].delta.content, end="")