WELCOME TO RICHFIELD DISTANCE LEARNING

Published on
Embed video
Share video
Ask about this video

Scene 1 (0s)

[Audio] WELCOME TO RICHFIELD DISTANCE LEARNING BIG DATA & INTERNET OF THINGS 600.

Scene 2 (7s)

[Audio] PRESCRIBED OR RECOMMENDED BOOKS Prescribed Textbook: Geng, H. (2017).Internet of Things and Data Analytics Handbook. United States: Wiley. ISBN:9781119173649 Recommended Reading: Big Data, IOT and Machine Learning Tools and applications ISBN:9780367336745.

Scene 3 (47s)

[Audio] BIG DATA & IOT 600 Chapter 1: Big Data Analysis and Extraction Techniques Chapter 2: IoT Architectures and Applications Chapter 3: IoT technologies and Standards Chapter 4: Big Data Storage and Security Chapter 5: Big Data ethics, standards and policies.

Scene 4 (1m 14s)

[Audio] Chapter 1 Big Data Analysis and Extraction Techniques.

Scene 5 (1m 21s)

[Audio] Big Data is the term used to describe the application of specialized techniques to process very large sets of data, which are often too complex to process using regular database management tools. The challenges that big data solves include capture, storage, search, sharing, transfer, analysis, and visualization..

Scene 6 (1m 44s)

[Audio] Main characteristics of big data Volume Velocity Variety Veracity Value.

Scene 7 (1m 52s)

[Audio] Volume (Scale) Data Volume 44x increase from 2009 to 2020 From 0.8 zettabytes to 35zb Data volume is increasing exponentially Exponential increase in collected/generated data.

Scene 8 (2m 11s)

[Audio] mobile-internet TrendsMontage 2+ billion people on the Web by end 2011 30 billion RFID tags today (1.3B in 2005) 4.6 billion camera phones world wide 100s of millions of GPS enabled devices sold annually 76 million smart meters in 2009… 200M by 2014 12+ TBs of tweet data every day ? TBs of data every day 25+ TBs of log data every day.

Scene 9 (2m 51s)

[Audio] Variety (Complexity) Relational Data (Tables/Transaction/Legacy Data) Text Data (Web) Semi-structured Data (XML) Graph Data Social Network, Semantic Web (RDF), … Streaming Data You can only scan the data once A single application can be generating/collecting many types of data Big Public Data (online, weather, finance, etc) To extract knowledge all these types of data need to linked together.

Scene 10 (3m 23s)

[Audio] Velocity (Speed) Data is being generated fast and need to be processed fast Online Data Analytics Late decisions  missing opportunities Examples E-Promotions: Based on your current location, your purchase history, what you like  send promotions right now for store next to you Healthcare monitoring: sensors monitoring your activities and body  any abnormal measurements require immediate reaction.

Scene 11 (3m 51s)

[Audio] Veracity Data veracity, in general, is how accurate or truthful a data set may be. In the context of big data, however, it takes on a bit more meaning. More specifically, when it comes to the accuracy of big data, it’s not just the quality of the data itself but how trustworthy the data source, type, and processing of it is..

Scene 12 (4m 15s)

[Audio] Value Insight discovery and pattern recognition that lead to more effective operations, stronger customer relationships and other clear and quantifiable business benefits..

Scene 13 (4m 27s)

[Audio] Sources of Big data Business Transaction Social Media Platforms Internet Of Things Web and digital activity Medical Records Scientific Research.

Scene 14 (4m 37s)

[Audio] Common tools and frameworks used for big data analysis. Hadoop Apache Spark Apache Kafka MongoDB Tableau Amazon Web Services (AWS) Apache Flink Power BI.

Scene 15 (4m 52s)

[Audio] Big Data Analysis Techniques: Descriptive analysis Predictive analysis Deep learning and Machine Learning (ML) Real-time analytics Sentiment analysis.

Scene 16 (5m 4s)

[Audio] Data Cleaning Finding and fixing mistakes, inconsistencies, and missing values in raw data is the goal of data cleaning..

Scene 17 (5m 14s)

[Audio] Data Normalization The goal is to standardize and scale data in order to facilitate efficient processing and comparison..

Scene 18 (5m 24s)

[Audio] Data Transformations The goal of data transformation is to transform unprocessed data into a format that may be used for analysis or model construction..

Scene 19 (5m 35s)

[Audio] Big Data Challenges Complexity: Data in different formats and structures (such as structured, semi-structured, and unstructured) must be handled by transformation pipelines. Real-Time Constraints: Workflows using streaming data require quick transformations. Resource Management: To reduce memory and computation consumption, large-scale transformations need to be optimized..

Scene 20 (6m 3s)

[Audio] Review questions What is Big Data, and what are its defining characteristics (e.g., the 3Vs or 5Vs)? Explain the difference between structured, unstructured, and semi-structured data. Describe how Big Data differs from traditional data management systems in terms of scale and processing. List at least three tools or frameworks commonly used for Big Data processing and describe their primary purposes. Compare Hadoop and Spark in terms of data processing capabilities and use cases. What is the role of NoSQL databases (e.g., MongoDB, Cassandra) in Big Data environments?.

Scene 21 (6m 46s)

[Audio] All the best with your studies!!!. All the best with your studies!!!.