The Impact of Technology Scaling on Lifetime Reliability
DSN '04 Proceedings of the 2004 International Conference on Dependable Systems and Networks
Towards Efficient Supercomputing: A Quest for the Right Metric
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 11 - Volume 12
Challenges of data center thermal management
IBM Journal of Research and Development - POWER5 and packaging
Mercury and freon: temperature emulation and management for server systems
Proceedings of the 12th international conference on Architectural support for programming languages and operating systems
Making scheduling "cool": temperature-aware workload placement in data centers
ATEC '05 Proceedings of the annual conference on USENIX Annual Technical Conference
Modeling of data center airflow and heat transfer: State of the art and future trends
Distributed and Parallel Databases
Thermo-Fluids Provisioning of a High Performance High Density Data Center
Distributed and Parallel Databases
IEEE Transactions on Parallel and Distributed Systems
Computer Networks: The International Journal of Computer and Telecommunications Networking
Cooperative power-aware scheduling in grid computing environments
Journal of Parallel and Distributed Computing
A survey of the research on power management techniques for high-performance systems
Software—Practice & Experience
SILENCE: distributed adaptive sampling for sensor-based autonomic systems
Proceedings of the 8th ACM international conference on Autonomic computing
Energy-Efficient Thermal-Aware Autonomic Management of Virtualized HPC Cloud Infrastructure
Journal of Grid Computing
Journal of Computational Physics
Thermal camera networks for large datacenters using real-time thermal monitoring mechanism
The Journal of Supercomputing
Integrating dynamic pricing of electricity into energy aware scheduling for HPC systems
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Hi-index | 0.00 |
The increasing demand for faster computing and high storage capacity has resulted in an increase in energy consumption and heat generation in datacenters. Because of the increase in heat generation, cooling requirements have become a critical concern, both in terms of growing operating costs as well as their environmental and societal impacts. Presently, thermal management techniques make an effort to thermally profile and control datacenters' cooling equipment to increase their efficiency. In conventional thermal management techniques, cooling systems are triggered by the temperature crossing predefined thresholds. Such reactive approaches result in delayed response as the temperature may already be too high, which can result in performance degradation of hardware.In this work, a proactive control approach is proposed that jointly optimizes the air conditioner compressor duty cycle and fan speed to prevent heat imbalance--the difference between the heat generated and extracted from a machine--thus minimizing the cost of cooling. The proposed proactive optimization framework has two objectives: (i) minimize the energy consumption of the cooling system, and (ii) minimize the risk of equipment damage due to overheating. Through thorough simulations comparing the proposed proactive heat-imbalance estimation-based approach against conventional reactive temperature-based schemes, the superiority of the proposed approach is highlighted in terms of cooling energy, response time, and equipment failure risk.