3. Megadata storage and management
In this section, we look at the problem of storing very large volumes of data for analysis. First, we point out the limitations of relational databases and discuss the benefits of cloud computing. We then highlight the advantages for data storage and management of the MapReduce parallel programming model, and of the free quadric, Hadoop, which implements it. Next, we introduce the various NoSQL database models, representing different solutions for storing megadata. Finally, we look at a few alternatives, including NewSQL databases.
3.1 Relational databases
Exclusive to subscribers. 97% yet to be discovered!
You do not have access to this resource.
Click here to request your free trial access!
Already subscribed? Log in!
The Ultimate Scientific and Technical Reference
This article is included in
Industry of the future
This offer includes:
Knowledge Base
Updated and enriched with articles validated by our scientific committees
Services
A set of exclusive tools to complement the resources
Practical Path
Operational and didactic, to guarantee the acquisition of transversal skills
Doc & Quiz
Interactive articles with quizzes, for constructive reading
Megadata storage and management
Bibliography
- (1) - TAYLOR (P.) - Volume of data/information created, captured, copied, and consumed worldwide from 2010 to 2020, with forecasts from 2021 to 2025. Statista, Nov 2023. https://www.statista.com/statistics/871513/worldwide-data-created/ ...
Websites
Mahout https://mahout.apache.org
BERTopic https://maartengr.github.io/BERTopic
Gargantext https://gargantext.org
Exclusive to subscribers. 97% yet to be discovered!
You do not have access to this resource.
Click here to request your free trial access!
Already subscribed? Log in!
The Ultimate Scientific and Technical Reference