ProductNew ProductsAnnouncing the Red Hat AI Factory with NVIDIA and Red Hat AI...

Announcing the Red Hat AI Factory with NVIDIA and Red Hat AI Enterprise

The Red Hat teams have been busy creating software and tools designed to scale AI at pace.

The Red Hat AI Factory with NVIDIA 

The Red Hat AI Factory with NVIDIA is a co-engineered software platform that combines Red Hat AI Enterprise and NVIDIA AI Enterprise to provide an end-to-end AI solution optimised for organisations deploying AI at scale. It delivers the newest AI innovations to enterprise customers whilst also delivering Day 0 support for NVIDIA hardware architectures.

Many organisations are looking to shift their strategies towards high-density, agentic AI workflows and address the resulting demands on AI inference and infrastructure. To help organisations keep pace, Red Hat AI Factory with NVIDIA empowers IT operations teams to streamline management of both traditional infrastructure and the evolving demands of the AI stack, by provisioning the underlying infrastructure to fuelling higher performance for the models and GPUs driving the inference stack. It provides a scalable foundation for AI deployments across any environment, whether on-premises, in the cloud or at the edge.

Chris Wright, chief technology officer and senior vice president, Global Engineering, at Red Hat comments:

“The shift from AI experimentation to industrial-scale, enterprise-wide production requires a fundamental change in how we manage the AI computing stack. We’re accelerating the path to deploy AI and move quickly to production using Red Hat AI Factory with NVIDIA. With a stable, high-performance foundation driven by our proven hybrid cloud offerings, we’re enabling our customers to own their AI strategy and scale with the same rigour they apply to their core IT platforms.”

Maintaining architectural control from the data centre to the public cloud

The Red Hat AI Factory with NVIDIA enables

  • Instant access to pre-configured models, including the indemnified IBM Granite family, NVIDIA Nemotron, and NVIDIA Cosmos open models, delivered as NVIDIA NIM microservices. Additionally, organisations can further align models to enterprise data using NVIDIA NeMo.
  • Built-in observability capabilities and taps Red Hat AI inference capabilities powered by vLLM, NVIDIA TensorRT-LLM, NVIDIA Dynamo, and NVIDIA BlueField to meet strict AI service level objectives. This helps organisations reduce the total cost of ownership (TCO) for AI by optimising the connection between models and NVIDIA GPUs.
  • Leveraging the flexible and stable foundation of Red Hat Enterprise Linux, organisations benefit from advanced security and compliance capabilities built-in from the start that help to lower risk, save time and mitigate downtime. This delivers a security-hardened foundation for mission-critical AI workloads that require isolation and continuous verification.

Availability: Red Hat AI Factory with NVIDIA is available now.

Red Hat AI Enterprise 

Red Hat AI Enterprise is an integrated AI platform for deploying and managing AI models, agents and applications across the hybrid cloud. It joins the Red Hat AI portfolio which includes Red Hat AI Inference Server, Red Hat OpenShift AI and Red Hat Enterprise Linux AI.

Red Hat is introducing Red Hat AI 3.3, bringing updates and enhancements across the company’s entire AI portfolio. The Red Hat solutions integrate underlying Linux and Kubernetes infrastructure with advanced agentic capabilities to help organisations move from fragmented experimentation to governed, autonomous operations.

Joe Fernandes, vice president and general manager, AI Business Unit, Red Hat, comments:

“For AI to deliver true business value, it must be operationalised as a core component of the enterprise software stack, not as a standalone silo. Red Hat AI Enterprise is designed to bridge the gap between infrastructure and innovation by providing a unified metal to agent platform. By integrating advanced tuning and agentic capabilities with the industry-leading foundation of Red Hat Enterprise Linux and Red Hat OpenShift, we are providing the complete stack – from the GPU-accelerated hardware to the models and agents that drive business logic. Additionally, with Red Hat AI 3.3 organisations can move beyond fragmented pilots to governed, repeatable and high-performance AI operations across the hybrid cloud.”

Assisting Enterprises in “Pilot Phase” to High Flying

Increasingly, organisations are reported to be remaining stuck in the “pilot phase” due to fragmented tools and inconsistent infrastructure.

Jeremy Foster, senior vice president and general manager, at Cisco Compute explains it as the experimentation phase, stating:

“Cisco is focused on helping customers move AI from experimentation to production, securely, at scale, and across hybrid and multi-cloud environments. By supporting Red Hat AI Enterprise and the Red Hat AI Factory with NVIDIA, Cisco enables customers to deploy and operate AI on a consistent, enterprise-grade infrastructure foundation, from data centre to edge. Together, we’re giving customers a simpler, more reliable way to run AI as a mission-critical workload, with the performance, security, and operational control they expect from their core infrastructure.”

Red Hat AI Enterprise addresses this by unifying the model and application lifecycles allowing IT teams to manage AI as a standardised enterprise system rather than a siloed project – making AI delivery as reliable and repeatable as traditional enterprise software.

Red Hat AI Enterprise is the essential hybrid cloud foundation for the new Red Hat AI Factory with NVIDIA, combining with NVIDIA’s accelerated computing software to hasten production AI for enterprises.

Discussing enterprise AI, Vlad Rozanovich, senior vice president, Infrastructure Solutions Group, at Lenovo states:

“The next era of enterprise AI is about real-time action and tangible business return, and that requires an industrial-strength, hybrid foundation. We can bring a scalable, enterprise-grade platform that combines Lenovo’s inferencing-optimised infrastructure with offerings like Red Hat AI Enterprise and the Red Hat AI Factory with NVIDIA, to give customers the real-time advantage – a resilient foundation for agentic AI that is deployable and manageable anywhere they operate.”

Key benefits of Red Hat AI Enterprise include:

  • Faster, more cost-effective and scalable AI inference using the vLLM inference engine and llm-d distributed inference framework for optimised generative AI model deployments across hybrid hardware environments.
  • Integrated observability and lifecycle management to help drive AI lifecycle governance and mitigate risk with an integrated, tested and interoperable enterprise-ready AI stack.
  • Flexibility across the hybrid cloud by empowering organisations to deploy and manage AI models, agents and applications with greater consistency wherever their business needs to run backed by trusted Red Hat platforms.

Red Hat AI 3.3 has new features and enhancements which include an expanded Red Hat AI model ecosystem with validated, production-ready compressed versions of Mistral-Large-3, Nemotron-Nano and Apertus-8B-Instruct, available via the OpenShift AI Catalog.

Additionally, the release enables deployment of state-of-the-art models like Ministral 3 and DeepSeek-V3.2 with sparse attention, while delivering multimodal enhancements including 3x Whisper speedup, geospatial support, improved EAGLE speculative decoding and enhanced tool calling for agentic workflows. IT teams can provide self-service access to privately hosted models via an API gateway. The platform has expanded its hardware certification for NVIDIA’s Blackwell architecture (B300) and support for AMD MI325X accelerators.

Excitement from the Channel

Francisco Criado, senior vice president, Cloud, Security and AI, TD SYNNEX, comments:

“As a leading end-to-end distributor and a long-standing partner to both Red Hat and NVIDIA, TD SYNNEX is excited to bring the Red Hat AI Factory with NVIDIA to our channel partners and their customers, as a complementary addition to the TD SYNNEX Destination AI program. This optimised, enterprise-grade solution removes the complexity of building and deploying AI, helping organisations operationalise their AI investments across the hybrid cloud and accelerate their journey to real business value.”

author avatar
Trish Stevens Head of Content
Trish is the Head of Content for In the Channel Media Group. [email protected]

RELATED ARTICLES

Read our latest magazine