Lara’s New Hardware, Co-Designed With Lenovo for AI Translation

Built for time-critical workflows, the new infrastructure enables Lara to outperform generic LLMs in quality and speed, opening up new use cases in localization.

Rome – Jun 10, 2025

Translated, a leader in AI-powered language solutions, today announced a major milestone for its translation AI, Lara, made possible through close collaboration with Lenovo, a global leader in high-performance computing. Built for high-volume production environments, Lara now delivers what was once considered a tradeoff: the fluency and reasoning of an LLM, and the low hallucination of machine translation, both now delivered with near-instant responsiveness.

To achieve this result, Translated co-designed a new hardware solution with Lenovo, purpose-built for translation, and developed an innovative decoding system to fully leverage the latest chips. Optimized for latency-critical scenarios like live chats, trading, and news, Lara now achieves sub-second P99 latency across the 50 most widely spoken languages. This breakthrough sets a new standard for high-quality, low-latency translation and enables new cost-efficient applications, such as only translating the portion of content needed upfront, while processing the rest on demand. Lara is now 10 to 40 times faster than leading LLMs in translation tasks, while delivering higher quality, making it a perfect fit for modern business workflows.


The Lenovo ThinkSystem server used for Lara.

To obtain this outcome, Lenovo provided ThinkSystem servers powered by NVIDIA’s GPUs, the world’s most advanced processors for AI workloads. Each server supports eight of the latest high-speed, interconnected GPUs, powering advancements in AI, including large language models, machine learning, model training, and high-performance computing. Through intense co-design work, Translated and Lenovo were able to optimize their architecture for the translation task. ThinkSystem servers were installed in two data centers in Washington and California, strategically positioned near major internet hubs to keep network latency between Lara and the main internet backbones under one millisecond.

"AI only works when it solves problems in real scenarios, with the speed required to support business at scale. We reached this milestone thanks to a partner that worked with us in the same way we work with our clients, by sharing goals, committing fully, and building for long-term impact. This type of partnership makes innovation possible".
Marco Trombetti – founder and CEO of Translated

To further enhance system responsiveness, Translated’s engineering team designed a new architecture, an industry first for translation AI. It combines the strengths of traditional machine translation and generative AI. This unique approach enables parallelized, context-aware generation of translations, significantly accelerating response time without compromising quality.

As part of their long-term collaboration, the two companies have also signed an agreement to implement liquid-cooling systems across Translated’s infrastructure. This will reduce energy consumption and allow for greater machine density, supporting more sustainable and scalable AI operations.

"Our advanced technology, combined with Translated's vision, has allowed us to achieve unprecedented speed and quality in the language industry. Lenovo ThinkSystem solutions represent the ultimate in AI performance, delivering powerful, reliable infrastructure for mission-critical applications. This partnership is an example of how AI can transform the way people communicate globally, offering faster and more accurate translations for an increasingly connected world".
Alessandro de Bartolo – GM, Italy, Infrastructure Solution Group, Lenovo

Enjoy the World’s
Most Reliable Translator

Experience Lara, a breakthrough translation AI that outperforms popular machine translation and approaches the quality of top-tier professional translators.

Try Lara Now