V4 Pro launched April 24, 2026. Been running it on production agents since. Specs Total params: 1.6T (MoE) Active params: 49B Context: 1M tokens (verified) Modes: Think / Non-Think dual License: MIT Pricing: $1.74/1M input, $3.48/1M output API Setup (OpenAI-compatible) client = OpenAI( base_url="https://integrate.api.nvidia.com/v1", api_key="" ) response = client.chat.completions.crea