Skip to content
IBM

IBM Granite 4.0 H Micro

flagship
IBM · released 2025-01-01 · text
currently routing · 4.2k rpm
8.2M tokens
Context
— / 1M
Input
— / 1M
Output
— t/s
Speed
open
License
/ ABOUT

IBM Granite 4.0 H Micro is a compact language model from IBM's Granite family, designed for efficient enterprise AI applications. The 'Micro' designation indicates it's the smallest variant, optimized for deployment in resource-constrained environments including edge devices and on-premise servers with limited GPU memory.

Despite its small size, Granite 4.0 H Micro delivers capable performance on business tasks like text classification, summarization, extraction, and basic reasoning. It was trained on carefully curated enterprise data with a focus on business-relevant domains including finance, legal, healthcare, and technology.

This model is ideal for organizations that need AI capabilities without heavy infrastructure investment, supporting IBM's strategy of bringing AI to where the data lives.

BENCHMARKS Artificial Analysis Index
Intelligence 11

Providers for IBM Granite 4.0 H Micro

1 routes · sorted by uptime

ClosedRouter routes requests to the providers best able to handle your prompt size and parameters, with automatic fallbacks to maximize uptime.

Provider
Context
Quant
Uptime · 30d
bf16
0.00%