CMPF104: Data Cleaning and Preprocessing: Data science and Data Anaytics: Programming For Foundation In Engineering, Assignment, UNITEN, Malaysia
University | Universiti Tenaga Nasional (UNITEN) |
Subject | CMPF104: Programming For Foundation In Engineering |
Data science and Data Anaytics
Download the dataset from BRIGHTEN. If your student ID ends with an odd number, select Concrete_Data_A dataset, and if your student ID ends with an even number, select Concrete_Data_B dataset. Using the Python attributes, function and libraries to solve the following problems.
a) Data Cleaning and Preprocessing:
- Â Use Pandas to load the dataset. Name the dataframe as concrete_df_XXX.
- Remove ‘Number’ column using .drop() function and visualize the first ten (10)
rows of the data. - Handle any missing values by dropping or replacing the empty cells. Check for missing values using functions like .info() or .isnull().sum()
- Convert the data frame to array, using to_numpy() function.
- Divide the data into two sets of data with division of 80% and 20% for train and test data, respectively. Name the dataset as train_data_XXX and test_data_XXX
Get Solution of this Assessment. Hire Experts to solve this assignment for you Before Deadline.
b) Data Analysis:
- Calculate the correlation between the variables in the dataframe.
- Utilize NumPy and Pandas to calculate summary statistics of the data such as
maximum, minimum, standard deviation, average, median and mode of each
category. - Â Use Pandas functions like .describe() for an overview of summary statistics and apply NumPy functions for specific calculations.
c) Visualization:
- Â Use Matplotlib to create visualizations such as line plots for train and test data
across all categories. - Generate histogram plots and box plots for all variables.
- Ensure that the visualizations are clear, informative, and aesthetically pleasing.
- Customize your plots by adding the titles, labels and legends
Stuck in Completing this Assignment and feeling stressed ? Take our Private Writing Services.
Get Help By Expert
Do you need assistance with CMPF104: Programming For Foundation In Engineering assignments? Our assignment helper in Malaysia offers expert help. We specialize in programming assignment writing to ensure your academic success. Let us handle your coursework while you focus on learning. Invest in your education with our reliable services for top-notch quality and improved performance.
Answer
Recent Solved Questions
- With the current growing populist rhetoric in the western democracies such as the U.S.A: International business Assignment, HWU, Malaysia
- Energy Economics Report, SU, Malaysia With climate change, water scarcity will likely be amplified due to drought and wildfire coupled with increasing demand
- TEE103: i. Calculate the voltage Vs, by using Kirchhoff’s law and Ohm’s.: Circuit Theory I Assignment, WOU, Malaysia
- BTA3324: Sales and service tax (SST) was reintroduced on 1 September 2018 to replace goods and services tax (GST): Taxation 2 Assignment, UOW, Malaysia
- Business Professional Ethics Case Study, UTHM, Malaysia This Milk manufacturer is based in Port Klang, Selangor. The Managers of the company are very friendly with the customers
- MSBA7113: Recommend a sampling method (random, stratified, or convenience) for Retail Nation: Understanding Research Methods Assignment, CUM, Malaysia
- Master of Energy Systems Assignment, UOM, Malaysia Conventional energy sources like coal, gas, and oil are rapidly depleting while the world energy demand is growing more rapidly
- Identify a conflict on the theme of change that is explored in the novel and argue: Malaysian Popular Literature and Culture Assignment, UNM, Malaysia
- Development and implementation of an effective quality management strategy require organisations: Quality Management Assignment, OUM, Malaysia
- Human Computer Interaction Assignment, UU, Malaysia In order to use external knowledge sources for innovation activities in organizations, recently crowdsourcing platforms