Fabio Lauria

Google DeepMind AI Cooling System: How Artificial Intelligence Revolutionizes Data Center Energy Efficiency

September 14, 2025
Share on social media

Artificial intelligence applied to data center cooling represents one of the most significant innovations in industrial energy optimization.

The autonomous system developed by Google DeepMind, which has been operational since 2018, has demonstrated how AI can transform the thermal management of critical infrastructure, achieving concrete results in terms of operational efficiency.

Innovation Transforming Data Centers.

The Problem of Energy Efficiency

Modern data centers are huge energy consumers, with cooling accounting for about 10 percent of total electricity consumption according to Jonathan Koomey, a global energy efficiency expert. Every five minutes, Google's cloud-based AI system captures a snapshot of the cooling system from thousands of sensors Safety-first AI for autonomous data center cooling and industrial control - Google DeepMind, analyzing operational complexity that defies traditional control methods.

Google's AI cooling system uses deep neural networks to predict the impact of different combinations of actions on future energy consumption, identifying which actions will minimize consumption while meeting robust security constraints DeepMind AI Reduces Google Data Centre Cooling Bill by 40% - Google DeepMind.

Concrete and Measurable Results

The results obtained in cooling optimization are significant: the system was able to consistently achieve a 40% reduction in the energy used for cooling DeepMind AI Reduces Google Data Centre Cooling Bill by 40% - Google DeepMind. However, considering that cooling accounts for about 10 percent of total consumption, this translates to about 4 percent in overall data center energy savings.

According to Jim Gao's original technical paper, the neural network achieves a mean absolute error of 0.004 and standard deviation of 0.005, equivalent to an error of 0.4% for a PUE of 1.1 DeepMind AI Reduces Google Data Centre Cooling Bill by 40% - Google DeepMind.

Where It Works: The Confirmed Data Centers

Verified Implementations

The implementation of the AI system has been officially confirmed in three specific data centers:

Singapore: The first significant deployment in 2016, where the data center uses reclaimed water for cooling Homepage - Google Data Centers and demonstrated 40 percent reduction in cooling energy.

Eemshaven, Netherlands: Data center uses industrial water and consumed 232 million gallons of water in 2023 Homepage - Google Data Centers. Marco Ynema, site lead for the facility, oversees the operations of this advanced facility.

Council Bluffs, Iowa: The MIT Technology Review specifically showcased the Council Bluffs data center during the discussion of the AI system Google Cloud's Data Center Locations: Regions and Availability Zones - Dgtl Infra. Google invested $5 billion in the two Council Bluffs campuses, which consumed 980.1 million gallons of water in 2023 China Powers AI Boom with Undersea Data Centers | Scientific American.

A cloud-based AI control system is now operational and providing energy savings in multiple data centers Google Smart Liquid Cooling: Beating Google on Efficiency | ProphetStor, but the company has not released the full list of facilities using the technology.

Technical Architecture: How It Works

Deep Neural Networks and Machine Learning

According to patent US20180204116A1, the system uses adeep learning architecture with precise technical features:

  • 5 hidden layers with 50 nodes per layer
  • 19 normalized input variables including heat loads, weather conditions, equipment status
  • 184,435 training samples at 5-minute resolution (about 2 years of operational data)
  • Regularization parameter: 0.001 to prevent overfitting

The architecture uses Model Predictive Control with linear ARX models integrated with deep neural networks. Neural networks do not require the user to predefine interactions between variables in the model. Instead, the neural network searches for patterns and interactions between features to automatically generate an optimal model DeepMind AI Reduces Google Data Centre Cooling Bill by 40% - Google DeepMind.

Power Usage Effectiveness (PUE): The Key Metric

PUE represents the fundamental energy efficiency of data centers:

PUE = Total Data Center Energy / IT Equipment Energy

  • PUE Google fleet-wide: 1.09 in 2024 (according to Google environmental reports)
  • Industrial average: 1.56-1.58
  • Ideal PUE: 1.0 (theoretically impossible)

Google holds ISO 50001 certification for energy management, which ensures strict operational standards but does not specifically validate AI system performance.

Model Predictive Control (MPC)

At the heart of the innovation is predictive control that predicts future data center temperature and pressure in the next hour, simulating recommended actions to ensure not to exceed operational constraints DeepMind AI Reduces Google Data Centre Cooling Bill by 40% - Google DeepMind.

Operational Benefits of AI in Cooling

Superior Predictive Accuracy

After trial and error, the models are now 99.6 percent accurate in predicting PUE Machine Learning Applications for Data Center Optimization. This accuracy enables optimizations impossible with traditional methods, simultaneously handling the complex nonlinear interactions between mechanical, electrical, and environmental systems.

Continuous Learning and Adaptation

One significant aspect is the evolutionary learning capability. Over the course of nine months, system performance increased from a 12 percent improvement at initial launch to about a 30 percent improvement Data Center Optimization Jim Gao, Google - DocsLib.

Dan Fuenffinger, Google operator, noted, "It was amazing to see AI learn how to take advantage of winter conditions and produce colder-than-normal water. The rules don't get better over time, but AI does" Data Center Cooling using Model-predictive Control.

Multi-Variable Optimization

The system handles 19 critical operational parameters simultaneously:

  • Total IT load of servers and networking
  • Weather conditions (temperature, humidity, enthalpy)
  • Equipment status (chillers, cooling towers, pumps)
  • Setpoints and operational controls
  • Fan speeds and VFD systems

Security and Control: Fail-Safe Guaranteed

Multi-Level Verification

Operational safety is ensured through redundant mechanisms. Optimal actions calculated by AI are checked against an internal list of operator-defined security constraints. Once sent to the physical data center, the local control system re-verifies the instructions DeepMind AI reduces energy used for cooling Google data centers by 40 percent.

Operators always maintain control and can exit AI mode at any time, seamlessly transferring to traditional rules DeepMind AI reduces energy used for cooling Google data centers by 40 percent.

Limitations and Methodological Considerations

PUE Metrics and Its Limitations

Industry recognizes the limitations of Power Usage Effectiveness as a metric. A 2014 Uptime Institute survey found that 75 percent of respondents believed the industry needed a new efficiency metric. Problems include climate bias (impossible to compare different climates), time manipulation (measurements during optimal conditions), and component exclusion.

Complexity of Implementation

Each data center has unique architecture and environment. A custom model for one system may not be applicable to another, requiring a general intelligence framework DeepMind AI Reduces Google Data Centre Cooling Bill by 40% - Google DeepMind.

Data Quality and Verifications

The accuracy of the model depends on the quality and quantity of the input data. Model error generally increases for PUE values above 1.14 due to the scarcity of corresponding training data DeepMind AI Reduces Google Data Centre Cooling Bill by 40% - Google DeepMind.

No independent audits by major audit firms or national laboratories were found, with Google "not pursuing third-party audits" beyond the minimum federal requirements.

The Future: Evolution toward Liquid Cooling

Technology Transition

In 2024-2025, Google has shifted emphasis dramatically to:

  • +/-400 VDC power systems for 1MW racks
  • "Project Deschutes" cooling distribution units
  • Direct liquid cooling for TPU v5p with "99.999% uptime"

This change indicates that AI optimization has reached practical limits for the thermal loads of modern AI applications.

Emerging Trends

  • Edge computing integration: distributed AI for reduced latency
  • Digital twins: Digital twins for advanced simulation
  • Sustainability focus: Optimization for renewable energy
  • Hybrid cooling: AI-optimized liquid/air combination

Applications and Opportunities for Companies

Areas of Application

AI optimization for cooling has extended applications beyond data centers:

  • Industrial plants: Manufacturing HVAC systems optimization
  • Shopping malls: Intelligent climate management
  • Hospitals: Environmental control operating rooms and critical areas
  • Corporate Offices: Smart building and facility management

ROI and Economic Benefits

Energy savings on cooling systems result in:

  • Reduced operating costs of the cooling subsystem
  • Improving environmental sustainability
  • Equipment life extension
  • Increased operational reliability

Strategic Implementation for Companies

Adoption Roadmap

Phase 1 - Assessment: Energy audit and mapping existing systemsPhase2 - Pilot: Testing in controlled environment on limited sectionPhase3 - Deployment: Progressive rollout with intensive monitoringPhase4 - Optimization: Continuous tuning and capacity expansion

Technical Considerations

  • Sensor infrastructure: Comprehensive monitoring network
  • Team skills: data science, facility management, cybersecurity
  • Integration: Compatibility with legacy systems
  • Compliance: Safety and environmental regulations

FAQ - Frequently Asked Questions

1. In which Google data centers is the AI system actually operating?

Three data centers are officially confirmed: Singapore (first deployment 2016), Eemshaven in the Netherlands, and Council Bluffs in Iowa. The system is operational in multiple data centers Google Smart Liquid Cooling: Beating Google on Efficiency | ProphetStor but the full list has never been publicly disclosed.

2. How much energy saving does it really produce on total consumption?

The system achieves a 40% reduction in the energy used for cooling DeepMind AI Reduces Google Data Centre Cooling Bill by 40% - Google DeepMind. Considering that cooling accounts for about 10 percent of total consumption, the total energy savings is about 4 percent of total data center consumption.

3. What is the accuracy of the system in forecasting?

The system achieves 99.6% accuracy in PUE prediction with an average absolute error of 0.004 ± 0.005, equivalent to an error of 0.4% for a PUE of 1.1 Google DeepMindGoogleResearch. If the actual PUE is 1.1, the AI predicts between 1.096 and 1.104.

4. How do you ensure operational security?

It uses two-level verification: first the AI checks the security constraints defined by operators, then the local system checks the instructions again. Operators can always turn off AI checking and return to traditional systems DeepMind AI reduces energy used for cooling Google data centers by 40 percent.

5. How long does it take to implement such a system?

Implementation typically takes 6-18 months: 3-6 months for data collection and model training, 2-4 months for pilot testing, 3-8 months for phased deployment. Complexity varies significantly depending on the existing infrastructure.

6. What technical skills are needed?

It takes a multidisciplinary team with expertise in data science/AI, HVAC engineering, facility management, cybersecurity, and system integration. Many companies opt for partnerships with specialized vendors.

7. Can the system adapt to seasonal changes?

Yes, the AI automatically learns to take advantage of seasonal conditions, such as producing cooler water in winter to reduce cooling energy Data Center Cooling using Model-predictive Control. The system continuously improves by recognizing time and weather patterns.

8. Why doesn't Google commercialize this technology?

Each data center has unique architecture and environment, requiring significant customization DeepMind AI Reduces Google Data Centre Cooling Bill by 40% - Google DeepMind. Complexity of implementation, need for specific data, and required skills make direct commercialization complex. After 8 years, this technology remains exclusively internal to Google.

9. Are there independent performance reviews?

No independent audits by major audit firms (Deloitte, PwC, KPMG) or national laboratories were found. Google holds ISO 50001 certification but "does not pursue third-party audits" beyond the minimum federal requirements.

10. Is it applicable to other industries beyond data centers?

Absolutely. AI optimization for cooling can be applied to industrial plants, shopping centers, hospitals, corporate offices, and any facility with complex HVAC systems. The principles of multi-variable optimization and predictive control are universally applicable.

The Google DeepMind AI cooling system represents an engineering innovation that achieves incremental improvements within a specific domain. For companies operating energy-intensive infrastructure, this technology offers real opportunities for cooling optimization, even with the highlighted limitations of scale.

Primary Sources: Jim Gao Google Research paper, DeepMind Official Blog, MIT Technology Review, Patent US20180204116A1

Fabio Lauria

CEO & Founder | Electe

CEO of Electe, I help SMEs make data-driven decisions. I write about artificial intelligence in business.

Most popular
Sign up for the latest news

Receive weekly news and insights in your
inbox. Don't miss it!

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.