4 hours of instruction
A theoretical course covering topics on how to handle data at scale and the different tools needed for distributed data storage, analysis, and management. Learners will be able to dive into the vast world of data and computing at scale and get a comprehensive overview of distributed computing.
OBJECTIVES
- Explore the big data ecosystem and explore tools and methodologies needed for distributed data storage and big data analysis
PREREQUISITES
Optimizing Ensemble Methods
SYLLABUS & TOPICS COVERED
- Intro To Big Data
- Data at scale
- Major sources of big data and industries that deal with it on daily basis
- Distributed Data Storage And Analysis
- Need for distributed data storage
- Scalability, fault tolerance, and reliability
- Tools for distributed data storage
SOFTWARE REQUIREMENTS
TBD
Login
Accessing this course requires a login. Please enter your credentials below!