Big Data Systems and Analytics

Shobha Gangadhar Tilak & Jyoti Shetty

Start:
Ende:

Montag, 24.8. um 10:00 Uhr
Mittwoch, 26.8. um 14:15 Uhr

Unterrichtssprache: Englisch

Kursbeschreibung:

This course provides an introduction to the core principles of big data systems and analytics, with a particular emphasis on handling extensive datasets in a distributed setting. The curriculum highlights distributed computing models such as Hadoop and HPCC Systems, covering aspects like block storage, file systems, Map-Reduce Jobs, and the CAP Theorem. Students will gain a deep understanding of batch processing, in-memory distributed processing, and stream processing. Furthermore, the course explores the architectures and functionalities of pivotal components within the Hadoop ecosystem, including Flume, Sqoop, HBase, Hive, and Pig, tailored for both structured and unstructured data analytics. Practical demonstrations will illustrate how these tools can be effectively utilized for comprehensive data analysis in a distributed environment. Participants will also delve into applying these concepts to real-world problems.

Voraussetzungen:

Ein Laptop mit installiertem Hadoop und Hive. Unterstützung wird im Rahmen des vorgesehenen E-Mail-Austauschs mit den Teilnehmenden bereitgestellt, indem Links zum Herunterladen und Installieren der Software zur Verfügung gestellt werden.

Biographie: Shobha Gangadhar Tilak

Shobha G, Professor, Dean, School of Computer Science & Engineering, RV University, India has teaching experience of 31 years, her specialization includes Data mining, Machine Learning and Image processing. She has published more than 150 papers in reputed journals / conferences. She has also executed sponsored projects worth INR 300 lakhs funded from various agencies nationally and internationally. She is a recipient of various awards such as Career Award for young teachers 2007-08 constituted by All India Council of Technical Education, Best Researcher award from Cognizant 2017, GHC Faculty Scholar for Women in Computing in 2018, IBM Shared University Research Award in 2019, HPCC Systems community recognition award 2020, HPCC Mentorship award 2021 and Pass it on Award 2023. She is also an advisory committee member for IET India Scholarship Award 2021 to till date.

Biographie: Jyoti Shetty

Dr. Jyoti Shetty is Associate Professor at the Computer Science and Engineering Department at RV College of Engineering in Bengaluru, India. She has 18 years teaching and 2 year industry experience. Her specialization includes Data Mining, Machine Learning, Artificial Intelligence. She has published 50+ research papers in reputed journals and conferences. She has also executed sponsored projects funded from various agencies nationally and internationally. She had delivered expert talk at various IT Industry. She was the recipient of awards such as SAP Award of excellence from IIT Bombay for demonstrating ICT in education in 2016 and HPCC Systems Mentor Badge Award in 2021 for providing guidance and direction towards the successful completion of intern open source projects, and Best paper presentation award at ICBDA 2023 Conference. She is recipient of Best Young Teacher Award from RVCE, ISTE Chapter, September, 2024, and HPCC Systems Community Recognition Award, Oct 2024.