added

Multiport Support for Custom Image Deployments Now Available

7 months ago by Vineet Jaiswal

We are excited to introduce a powerful new feature for our platform—multiport support for custom image deployments!

improved

Support added for Qwen model finetuning

7 months ago by Vineet Jaiswal

The Qwen team has recently released the latest model series, Qwen 2.5, marking a significant advancement in their offerings. The model is designed to enhance performance, broaden application capabilities, and provide users with a more versatile AI experience.

added

Llama 3.3 70B model deployment is available now

7 months ago by Vineet Jaiswal

LLama 3.3 is Meta's latest and most powerful 70-billion-parameter language model, offering state-of-the-art performance across a wide range of natural language processing tasks. It represents a significant step forward in terms of scalability, efficiency, and fine-tuning flexibility.

added

VLM model finetuning and deployment

7 months ago by Vineet Jaiswal

VLM finetuning allows you to adapt pre-trained multimodal models to better handle specific types of images and tasks relevant to your use case. Here are some key aspects:

deprecated

Phi 3 and Mistral 7B Serverless API deprecation notice

7 months ago by Vineet Jaiswal

microsoft/Phi-3-mini-4k-instruct and mistralai/Mistral-7B-Instruct-v0.2 serverless APIs will be deprecated on 18th January 2025. That means Phi3 mini and Mistral 7B LLM Serverless APIs will no longer process any requests from 18th January 2025 onwards.

improved

Llama 3 API Upgrade to Llama 3.1

11 months ago by Rishabh Pandey

We’re thrilled to announce a significant upgrade to our API! The Llama 3 8B model has now been upgraded to the latest Llama 3.1 8B version.

added

New Instruction Synthesizer API for generating structured datasets

about 1 year ago by Rishabh Pandey

Unlock the Future of Language Models with Our Revolutionary Instruction Synthesizer API! 🚀

Boosting LLM Performance with Unsloth and SDPA Integration 🚀

about 1 year ago by Rishabh Pandey

We're thrilled to unveil two major upgrades to MonsterTuner, designed to supercharge your LLM fine-tuning: Unsloth and Scaled Dot-Product Attention (SDPA). These innovations bring significant enhancements in performance, efficiency, and context length.

deprecated

TinyLlama LLM API Deprecation Notice

about 1 year ago by Rishabh Pandey

TinyLlama/TinyLlama-1.1B-Chat-v1.0 deprecated from July 3rd, 2024

added

Gemma 2 9B Model Implemented!

about 1 year ago by Rishabh Pandey

We are pleased to announce the implementation of the latest model: google/gemma-2-9b-it.