Tuesday, September 26, 2023
LBNN
  • Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
No Result
View All Result
LBNN

Algorithm breaks the exabyte barrier

Simon Osuji by Simon Osuji
September 11, 2023
in Artificial Intelligence
0
Algorithm breaks the exabyte barrier
0
SHARES
0
VIEWS
Share on FacebookShare on Twitter


Machine learning masters massive data sets
Illustration of distributed HPC hardware and different communication channels. Credit: The Journal of Supercomputing (2023). DOI: 10.1007/s11227-023-05587-4

A machine-learning algorithm demonstrated the capability to process data that exceeds a computer’s available memory by identifying a massive data set’s key features and dividing them into manageable batches that don’t choke computer hardware. Developed at Los Alamos National Laboratory, the algorithm set a world record for factorizing huge data sets during a test run on Oak Ridge National Laboratory’s Summit, the world’s fifth-fastest supercomputer.

Related posts

Risk of a US Government Shutdown Is Fueled by Very Online Republicans

Risk of a US Government Shutdown Is Fueled by Very Online Republicans

September 26, 2023
Real-time inspection of defects on fuel cell surface by artificial intelligence

Real-time inspection of defects on fuel cell surface by artificial intelligence

September 26, 2023

Equally efficient on laptops and supercomputers, the highly scalable algorithm solves hardware bottlenecks that prevent processing information from data-rich applications in cancer research, satellite imagery, social media networks, national security science and earthquake research, to name just a few.

“We developed an ‘out-of-memory’ implementation of the non-negative matrix factorization method that allows you to factorize larger data sets than previously possible on a given hardware,” said Ismael Boureima, a computational physicist at Los Alamos National Laboratory. Boureima is first author of the paper in The Journal of Supercomputing on the record-breaking algorithm.

“Our implementation simply breaks down the big data into smaller units that can be processed with the available resources. Consequently, it’s a useful tool for keeping up with exponentially growing data sets.”

“Traditional data analysis demands that data fit within memory constraints. Our approach challenges this notion,” said Manish Bhattarai, a machine learning scientist at Los Alamos and co-author of the paper.

“We have introduced an out-of-memory solution. When the data volume exceeds the available memory, our algorithm breaks it down into smaller segments. It processes these segments one at a time, cycling them in and out of the memory. This technique equips us with the unique ability to manage and analyze extremely large data sets efficiently.”

The distributed algorithm for modern and heterogeneous high-performance computer systems can be useful on hardware as small as a desktop computer, or as large and complex as Chicoma, Summit or the upcoming Venado supercomputers, Boureima said.

“The question is no longer whether it is possible to factorize a larger matrix, rather how long is the factorization going to take,” Boureima said.

The Los Alamos implementation takes advantage of hardware features such as GPUs to accelerate computation and fast interconnect to efficiently move data between computers. At the same time, the algorithm efficiently gets multiple tasks done simultaneously.

Non-negative matrix factorization is another installment of the high-performance algorithms developed under the SmartTensors project at Los Alamos.

In machine learning, non-negative matrix factorization can be used as a form of unsupervised learning to pull meaning from data, Boureima said. “That’s very important for machine learning and data analytics because the algorithm can identify explainable latent features in the data that have a particular meaning to the user.”

The record-breaking run

In the record-breaking run by the Los Alamos team, the algorithm processed a 340-terabyte dense matrix and an 11-exabyte sparse matrix, using 25,000 GPUs.

“We’re reaching exabyte factorization, which no one else has done, to our knowledge,” said Boian Alexandrov, a co-author of the new paper and a theoretical physicist at Los Alamos who led the team that developed the SmartTensors artificial intelligence platform.

Decomposing or factoring data is a specialized data-mining technique aimed at extracting pertinent information, simplifying the data into understandable formats.

Bhattarai further emphasized the scalability of their algorithm, remarking, “In contrast, conventional methods often grapple with bottlenecks, mainly due to the lag in data transfer between a computer’s processors and its memory.”

“We also showed you don’t necessarily need big computers,” Boureima said. “Scaling to 25,000 GPUs is great if you can afford it, but our algorithm will be useful on desktop computers for something you couldn’t process before.”

More information:
Ismael Boureima et al, Distributed out-of-memory NMF on CPU/GPU architectures, The Journal of Supercomputing (2023). DOI: 10.1007/s11227-023-05587-4

Provided by
Los Alamos National Laboratory

Citation:
Machine learning masters massive data sets: Algorithm breaks the exabyte barrier (2023, September 11)
retrieved 11 September 2023
from https://techxplore.com/news/2023-09-machine-masters-massive-algorithm-exabyte.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.





Source link

Previous Post

Max Q: Elon says Starship is ready, FAA says not quite

Next Post

Unstable Diffusion AI: How to Use (2023)

Next Post
Unstable Diffusion AI: How to Use (2023)

Unstable Diffusion AI: How to Use (2023)

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

RECOMMENDED NEWS

Hollywood Writers Real Tentative Deal, Actors Strike Still On

Hollywood Writers Real Tentative Deal, Actors Strike Still On

1 day ago
7 Tips to Maximize Mentor Relationships in Business

7 Tips to Maximize Mentor Relationships in Business

4 weeks ago
President William Ruto masters oratory to hit opponents

President William Ruto masters oratory to hit opponents

2 months ago

FCPT Announces Acquisition of a Texas Roadhouse Property for $3.7 Million

3 months ago

POPULAR NEWS

  • When Will SHIB Reach $1? Here’s What ChatGPT Says

    When Will SHIB Reach $1? Here’s What ChatGPT Says

    0 shares
    Share 0 Tweet 0
  • Warren Buffet Predicts How Long USD Will Remain Global Currency

    0 shares
    Share 0 Tweet 0
  • Mass Casualty At Concerts: Vaccinated Crowds ‘Die Suddenly”

    0 shares
    Share 0 Tweet 0
  • 3 New Countries Could Join BRICS in the Next Summit

    0 shares
    Share 0 Tweet 0
  • 3 Important Facts About the Upcoming Summit in August

    0 shares
    Share 0 Tweet 0
  • Privacy Policy
  • Contact

© 2023 LBNN - All rights reserved.

No Result
View All Result
  • Home
  • Business
  • Politics
  • Markets
  • Crypto
  • Economics
    • Manufacturing
    • Real Estate
    • Infrastructure
  • Finance
  • Energy
  • Creator Economy
  • Wealth Management
  • Taxes
  • Technology
  • Artificial Intelligence
  • Investigative journalism
  • Art & Culture
  • Quizzes
    • Enneagram quiz
  • LBNN Store
  • LBNN Newsletter

© 2023 LBNN - All rights reserved.