Doubleword logo black
Product
Products
Doubleword API
NEW
Inference built for scale
Doubleword Inference Stack
High performance inference stack
Use Cases
Async Agents
Long running background agents
Synthetic Data Generation
Generate high volumes of data for fine- tuning
Data Processing
Apply intelligence to large volumes of data
Resources
Documentation
Technical docs and API reference
Workbooks
Ready-to-run examples
Seen in the Wild
Community content and projects
Resource Centre
All our blogs and guides
Technical Blog
Our blog on building inference systems
Al Dictionary
Key Al terms explained
Savings Calculator
See how much you save with Doubleword
Solutions
By Deployment Option
On-premiseCloudHybrid
By Team
AI, ML & Data SciencePlatform, DevOps & ITCompliance & Cyber
Pricing
Docs
Pricing
Get started - Free
Get started - Free
Resources
/
Blog
/
15/4 Weekly Update: HumanX and the Gemma 4 release
April 15, 2026

15/4 Weekly Update: HumanX and the Gemma 4 release

Meryem Arik
Share:
https://doubleword.ai/resources/15-4-weekly-update-humanx-and-the-gemma-4-release
Copied
To Webinar
•

Huge week for Doubleword at HumanX - one clear takeaway: more teams are moving non-real-time workloads off expensive realtime APIs.

New: Doubleword CLI

We’ve also shipped the dw CLI - making it much easier to run and scale async workloads, especially for coding agents.

You can prepare data, run batches, stream results, and track cost - all directly from your terminal (check it out below!)

Docs: https://docs.doubleword.ai/inference-api/dw-cli

New model: Gemma 4 (31B)

We’ve also added google/gemma-4-31B-it to Doubleword. It’s Google DeepMind’s most capable open model, built for advanced reasoning, coding, and multimodal tasks.

It’s now live at $0.07 / $0.20 per 1M tokens (batch) which is ~90% cheaper than comparable models like Claude Haiku 4.5 and GPT-5-mini.

If you’ve previously run batches with Qwen3.5-35B, this sits in a similar intelligence class - worth swapping the model and running the same job side-by-side to compare performance on your workload.

Footnotes

Table of contents:

Heading 2
Heading 3
Heading 4
Heading 5
Heading 6
"
Learn more about self-hosted AI Inference
Subscribe to our newsletter
Thanks you for subscription!
Oops! Something went wrong while submitting the form.

Stop overpaying for inference.

Teams use Doubleword to run low-cost, large-scale inference pipelines for async jobs.
‍
Free credits available to get started.

Get started - Free
Doubleword logo black
AI Inference, Built for Scale.
Products
Doubleword APIDoubleword Inference Stack
Use Cases
Async AgentsSynthetic Data GenerationData Processing
Resources
Seen in the WildDocumentationPricingAsync Pipeline BuilderResource CentreTechnical BlogAI Dictionary
Company
AboutPrivacy PolicyTerms of ServiceData Usage Policy
Careers
Hiring!
Contact
© 2026 Doubleword. All rights reserved.
We use cookies to ensure you get the best experience on our website.
Accept
Deny