Gemma 3 27B Instruction-Tuned INT4

Property	Value
Original Model	Google Gemma 3 27B
Quantization	INT4
Context Window	128K tokens
Training Tokens	14 trillion
License	Available on Terms of Use

What is gemma-3-27b-it-int4-awq?

This model is a highly efficient INT4-quantized version of Google's Gemma 3 27B instruction-tuned model, converted to HF+AWQ format for easier deployment. It maintains the powerful capabilities of the original model while significantly reducing the memory footprint through INT4 quantization. The model supports both text and image inputs, making it a versatile option for multimodal applications.

Implementation Details

The model leverages quantization-aware training (QAT) for INT4 precision, converted from the original Flax checkpoint. It retains the architecture's 128K context window and supports over 140 languages. The implementation allows for efficient deployment on resource-constrained environments while maintaining high performance across various tasks.

Multimodal capabilities with support for text and image inputs (896x896 resolution)
Output generation up to 8192 tokens
Optimized for deployment on consumer hardware
Converted to HF+AWQ format for broader compatibility

Core Capabilities

Strong performance in reasoning and factuality tasks (85.6% on HellaSwag)
Robust STEM and code generation capabilities (82.6% on GSM8K)
Multilingual support with strong performance (74.3% on MGSM)
Advanced vision-language understanding (85.6% on DocVQA)

Frequently Asked Questions

Q: What makes this model unique?

The model combines the powerful capabilities of Gemma 3 with efficient INT4 quantization, making it possible to run a state-of-the-art multimodal model on consumer hardware while maintaining high performance across various benchmarks.

Q: What are the recommended use cases?

The model excels in content creation, chatbots, text summarization, image analysis, research applications, and educational tools. It's particularly well-suited for deployments where resource efficiency is crucial while maintaining high-quality outputs.

gemma-3-27b-it-int4-awq