ICLR Paper: Learn Step Size Quantization @ibmresearch

IBM Research | ICLR Paper: Learn Step Size Quantization @ibmresearch | Uploaded June 2020 | Updated October 2024, 4 days ago.
As deep networks are increasingly deployed in memory-constrained and throughput-critical systems, there is a need to create AI models that can maintain accuracy – and, as a result, trust – while also consuming fewer resources. Researchers at IBM’s Almaden Research Laboratory have reached a new milestone in AI precision and developed an algorithm that matches the inference accuracy of a 32-bit network while using only three bits.

The researchers achieved this level of energy efficiency using a new process called “learned step size quantization,” which improves parameter change estimates in a low-precision network during training, to produce better performance. The research also uncovered evidence that AI systems seeking to optimize performance on a given system might run with as few as 2 bits. This advance means AI systems are steadily coming closer to the low levels of energy consumed by the human brain, while maintaining performance.

In-memory physical superposition meets few-shot continual learning

The Short: AI chips come to UAlbany, protecting AI models from attack, and tiny magnetic molecules

Capturing and transforming CO2 to mitigate climate change

$Whats Next in AI : AI We Can Trust Tune in to our multi-stream event spanning three weeks in November and featuring the AI scientists developing AI technologies for real-world implementation. Leaders agree Artificial Intelligence technologies are key to competitive advantage in the 21stCentury, but only a fraction of organizations are successfully executing in AI. In this virtual event, scientists and business experts from the MIT-IBM Watson AI Lab come together to review three key barriers to adoption –trust, scalability, and reasoning –and how we can solve these challenges through scientific advancement. 11/05 AI We Can Trust 11/12 AI We Can Scale 11/19 AI We Can Reason With Here are the details from the discussion in this video 9:05 Welcome and Introduction Mark Weber 9:15 AI Science for Real-World Impact Aude Oliva 9:30 Business Discussion Mark Weber, Aude Oliva, David Cox 9:45 Certifying Robustness Lily Weng 10:00 Business Discussion Mark Weber, Lily Weng, and David Cox 10:15 Safe AI Armando Solar-Lezama 10:30 Business Discussion Mark Weber and Armando Solar-Lezama 10:45 Algorithmic Fairness Mikhail Yurochkin 11:00 Business Discussion Mark Weber, Mikhail Yurochkin, and Sherif Botros 11:15 Identifying unreliable predictions in clinical risk models Collin Stultz 11:30 Business Discussion Mark Weber, Collin Stultz, Jianying Hu and Ron Lancaster 11:45 Closing thoughts and preview of the next session Mark Weber, Aude Oliva, David Cox$

Understanding the NIST standards and IBMs contributions to post-quantum cryptography

AI and fully homomorphic encryption at IBM Research - Haifa

Four years of quantum computing on the IBM Cloud

The Short: Tiny benchmarks for LLMs, upending automation with gen AI and remembering Bob Dennard

Laying the Groundwork for Quantum Powered Use Cases

Effects of qubit frequency crowding on scalable quantum processors*