CMPF104: Data Cleaning and Preprocessing: Data science and Data Anaytics: Programming For Foundation In Engineering, Assignment, UNITEN, Malaysia
University | Universiti Tenaga Nasional (UNITEN) |
Subject | CMPF104: Programming For Foundation In Engineering |
Data science and Data Anaytics
Download the dataset from BRIGHTEN. If your student ID ends with an odd number, select Concrete_Data_A dataset, and if your student ID ends with an even number, select Concrete_Data_B dataset. Using the Python attributes, function and libraries to solve the following problems.
a) Data Cleaning and Preprocessing:
- Use Pandas to load the dataset. Name the dataframe as concrete_df_XXX.
- Remove ‘Number’ column using .drop() function and visualize the first ten (10)
rows of the data. - Handle any missing values by dropping or replacing the empty cells. Check for missing values using functions like .info() or .isnull().sum()
- Convert the data frame to array, using to_numpy() function.
- Divide the data into two sets of data with division of 80% and 20% for train and test data, respectively. Name the dataset as train_data_XXX and test_data_XXX
Get Solution of this Assessment. Hire Experts to solve this assignment for you Before Deadline.
b) Data Analysis:
- Calculate the correlation between the variables in the dataframe.
- Utilize NumPy and Pandas to calculate summary statistics of the data such as
maximum, minimum, standard deviation, average, median and mode of each
category. - Use Pandas functions like .describe() for an overview of summary statistics and apply NumPy functions for specific calculations.
c) Visualization:
- Use Matplotlib to create visualizations such as line plots for train and test data
across all categories. - Generate histogram plots and box plots for all variables.
- Ensure that the visualizations are clear, informative, and aesthetically pleasing.
- Customize your plots by adding the titles, labels and legends
Stuck in Completing this Assignment and feeling stressed ? Take our Private Writing Services.
Get Help By Expert
Do you need assistance with CMPF104: Programming For Foundation In Engineering assignments? Our assignment helper in Malaysia offers expert help. We specialize in programming assignment writing to ensure your academic success. Let us handle your coursework while you focus on learning. Invest in your education with our reliable services for top-notch quality and improved performance.
Answer
Recent Solved Questions
- MAT455: Further Calculus for Engineers Assignment, UiTM, Malaysia Plot the region using the mathematical application and Evaluate the given integral
- MGT7998E: The Relationship in Between Service Quality Dimensions and Patient Satisfaction of Public Hospital in Malaysia: RESEARCH Assignment, IIU, Malaysia
- BED15203: Write a C program that prompts user to enter 2 sides of a rectangle and the radius of a circle: Fundamentals Of Programming Assignment, UniKL, Malaysia
- PHYSIOTHERAPY Assignment, MU, Malaysia Mrs. Spoon is a 48-year-old woman who works part-time as a kindergarten assistant come to physiotherapy clinic with the history of Pain
- BBQT1013: Business Mathematics Assignment, CUM, Malaysia: Misuse and abuse of trade discounts infringe on fair trade laws and can cost companies stiff fines and legal fees
- Accounting Essay, UON, Malaysia Tulip Berhad sold goods to Mawar Berhad for RM1.5 million. The margin was 20% on the selling price which was the normal markup
- GMDS5223: The goal of this project is to analyze a real e-commerce dataset to identify patterns: Data Mining Assignment, UiTM, Malaysia
- Food and Beverages at Southwestern University Football Games: Decision Making in Business Case Study UNM, Malaysia
- Human Sciences in Communication Essay, UCISS, Malaysia In today’s global social media platforms such as Facebook, TikTok, YouTube, Twitter, Instagram, WhatsApp, Weibo, WeChat, Wikipedia, web sites, and blogs
- Diploma in Land Survey Assignment, UTM, Malaysia Calculate the volumes of cut and fill contained between consecutive cross-sections from CS10 and CS15