Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

AltCLIP

Brief Details: AltCLIP is a bilingual CLIP model supporting Chinese and English, trained on WuDao and LIAON datasets, offering superior performance in text-image retrieval tasks and stable diffusion capabilities.

Text-to-Image

Meta-Llama-3-8B-Instruct-FP8

neuralmagic

Brief Details: 8B parameter Llama-3 model optimized with FP8 quantization, achieving 99.28% accuracy recovery vs original while halving memory requirements

Text Generation

Lumina-Next-SFT-diffusers

Alpha-VLLM

Brief Details: A powerful 2B parameter text-to-image model using Next-DiT architecture with Gemma-2B text encoder, optimized through supervised fine-tuning for high-quality image generation.

Text-to-Image

controlnet-canny-sdxl-1.0

xinsir

Brief Details: A powerful ControlNet model for SDXL that generates Midjourney-quality images using edge detection, trained on 10M+ high-quality images.

Text-to-Image

control_v11p_sd15_openpose

lllyasviel

Brief-details: A specialized ControlNet model for human pose detection and image generation, trained on OpenPose data with improved hand/face detection capabilities.

Image-to-Image

opus-mt-es-gl

Helsinki-NLP

Brief Details: Helsinki-NLP's Spanish-to-Galician translation model, achieving 67.6 BLEU score. Uses transformer-align architecture with SentencePiece tokenization.

Translation

EVA-Qwen2.5-32B-v0.1-GGUF

bartowski

Brief-details: EVA-Qwen2.5-32B is a large language model with 32.8B parameters, offering multiple GGUF quantized versions for efficient deployment and various performance/size tradeoffs

Text Generation

blip-itm-base-coco

Salesforce

Brief-details: BLIP vision-language model trained on COCO dataset for image-text matching, supporting both understanding and generation tasks with state-of-the-art performance.

Transformers

Juggernaut-XL-Lightning

RunDiffusion

Brief-details: Optimized SDXL-based text-to-image model focused on speed and quality, featuring specialized settings for fast inference (5-7 steps) with high-quality output.

Text-to-Image

FLUX.1-dev-LoRA-Logo-Design

Shakker-Labs

Brief-details: A specialized LoRA model for minimalist logo design, built on FLUX.1-dev. Features unique trigger words and dual combination capabilities for creating professional logos.

Text-to-Image

SpaceLLaVA-lite

remyxai

Brief-details: SpaceLLaVA-lite is an enhanced spatial reasoning model built on MobileVLM, specialized in understanding object relationships in visual scenes through VQASynth techniques.

Transformers

align-base

kakaobrain

ALIGN-base: A dual-encoder vision-language model using EfficientNet and BERT, trained on COYO-700M dataset for zero-shot image classification and multi-modal embeddings

Zero-Shot Image Classification

Hermes-2-Pro-Mistral-7B

NousResearch

BRIEF DETAILS: A powerful 7B parameter Mistral-based model with enhanced function calling capabilities, achieving 90% accuracy in function calls and 84% in JSON outputs. Built for instruction-following and structured outputs.

Text Generation

ZootVision

Yntec

Brief-details: Text-to-image model specializing in versatile artistic styles, particularly strong in detailed backgrounds and anime-style artwork with 16K+ downloads

Text-to-Image

opus-mt-ca-it

Helsinki-NLP

Brief-details: A Helsinki-NLP translation model for Catalan to Italian conversion with strong BLEU score of 48.6 and chrF2 score of 0.69, built on transformer-align architecture

Translation

RadiantDiversions

Yntec

Brief Details: A versatile text-to-image model combining RadiantVibes and Paramount with Dreamlike_Diversions LoRA, specializing in photorealistic and fantasy imagery.

Text-to-Image

enformer-official-rough

EleutherAI

Brief Details: Enformer - A Transformer-based model for gene expression prediction from DNA sequences, developed by DeepMind and ported to PyTorch. CC-BY-4.0 licensed.

Transformers

DeepSeek-V2-Lite-Chat

deepseek-ai

Brief-details: DeepSeek-V2-Lite-Chat is a 15.7B parameter MoE model with 2.4B active params, featuring Multi-head Latent Attention and efficient inference capabilities for deployment on single 40GB GPU.

Text Generation

llava-v1.6-34b

liuhaotian

Brief-details: LLaVA-v1.6-34b is a large-scale multimodal model with 34.8B parameters, capable of processing image-text tasks using the Nous-Hermes-2-Yi-34B base.

Image-Text-to-Text

pola-photo-flux

alvdansen

Brief-details: A specialized LoRA model for Stable Diffusion that creates realistic Polaroid-style photos, trained on FLUX.1-dev with CC BY-NC 4.0 license. Perfect for vintage-inspired imagery.

Text-to-Image

Midnight-Rose-70B-v2.0.3

sophosympatheia

Brief-details: A sophisticated 70B parameter LLM optimized for roleplay and storytelling, featuring strong performance across multiple benchmarks with 67.11 avg score on OpenLLM leaderboard.

Text Generation

AltCLIP

Meta-Llama-3-8B-Instruct-FP8

Lumina-Next-SFT-diffusers

controlnet-canny-sdxl-1.0

control_v11p_sd15_openpose

opus-mt-es-gl

EVA-Qwen2.5-32B-v0.1-GGUF

blip-itm-base-coco

Juggernaut-XL-Lightning

FLUX.1-dev-LoRA-Logo-Design

SpaceLLaVA-lite

align-base

Hermes-2-Pro-Mistral-7B

ZootVision

opus-mt-ca-it

RadiantDiversions

enformer-official-rough

DeepSeek-V2-Lite-Chat

llava-v1.6-34b

pola-photo-flux

Midnight-Rose-70B-v2.0.3

The first platform built for prompt engineering