CMPF104: Data Cleaning and Preprocessing: Data science and Data Anaytics: Programming For Foundation In Engineering, Assignment, UNITEN, Malaysia
University | Universiti Tenaga Nasional (UNITEN) |
Subject | CMPF104: Programming For Foundation In Engineering |
Data science and Data Anaytics
Download the dataset from BRIGHTEN. If your student ID ends with an odd number, select Concrete_Data_A dataset, and if your student ID ends with an even number, select Concrete_Data_B dataset. Using the Python attributes, function and libraries to solve the following problems.
a) Data Cleaning and Preprocessing:
- Â Use Pandas to load the dataset. Name the dataframe as concrete_df_XXX.
- Remove ‘Number’ column using .drop() function and visualize the first ten (10)
rows of the data. - Handle any missing values by dropping or replacing the empty cells. Check for missing values using functions like .info() or .isnull().sum()
- Convert the data frame to array, using to_numpy() function.
- Divide the data into two sets of data with division of 80% and 20% for train and test data, respectively. Name the dataset as train_data_XXX and test_data_XXX
Get Solution of this Assessment. Hire Experts to solve this assignment for you Before Deadline.
b) Data Analysis:
- Calculate the correlation between the variables in the dataframe.
- Utilize NumPy and Pandas to calculate summary statistics of the data such as
maximum, minimum, standard deviation, average, median and mode of each
category. - Â Use Pandas functions like .describe() for an overview of summary statistics and apply NumPy functions for specific calculations.
c) Visualization:
- Â Use Matplotlib to create visualizations such as line plots for train and test data
across all categories. - Generate histogram plots and box plots for all variables.
- Ensure that the visualizations are clear, informative, and aesthetically pleasing.
- Customize your plots by adding the titles, labels and legends
Stuck in Completing this Assignment and feeling stressed ? Take our Private Writing Services.
Get Help By Expert
Do you need assistance with CMPF104: Programming For Foundation In Engineering assignments? Our assignment helper in Malaysia offers expert help. We specialize in programming assignment writing to ensure your academic success. Let us handle your coursework while you focus on learning. Invest in your education with our reliable services for top-notch quality and improved performance.
Answer
Recent Solved Questions
- JGB 22203: Applied Statistics Assignment, UniKL, Malaysia You are required to do a survey regarding the height of students
- BBMP1103: Given a matrix W, X and Y as below: a) The order of matrix W, X and Y: Mathematics For Management Assignment, OUM, Malaysia
- Business and Law Assignment, TU, Malaysia To deliver 200 bags of brown rice at the price of RM 4000 to Tinggi Mini Market
- SOPA1035: Social Policy and Social Pedagogies Case Study, UOG, Malaysia Cikgu Aminah teaches a preschool class in SK Tanah Abang, Mersing. She hails from the west coast of peninsular Malaysia
- Public Law Course Work, UiTM, Malaysia In response to the judicial outcome of the Afghan Hijackers case, then-future Prime Minister of the United Kingdom
- CSC1212: Data Communications and Networking Assignment, MMU, Malaysia You are interested in starting your own Gaming Store, The New-Gamers, in a suburban area of your town
- MBP2143 Project Risk, Procurement and Integration Management Report Malaysia
- AC4012: Introduction to Financial Accounting Report, SC, Malaysia You are required to choose TWO listed companies from Bursa Malaysia with the annual report from 2014 onwards
- Pharmacology in Nursing Essay, MSU, Malaysia Discuss combined drug therapy for the management of cancer and the nursing implications when managing patients receiving this therapy
- Microeconomics Assignment, HWU, Malaysia Plastic is a critical component of the modern economy and plays a significant role in various industries