CMPF104: Data Cleaning and Preprocessing: Data science and Data Anaytics: Programming For Foundation In Engineering, Assignment, UNITEN, Malaysia
| University | Universiti Tenaga Nasional (UNITEN) |
| Subject | CMPF104: Programming For Foundation In Engineering |
Data science and Data Anaytics
Download the dataset from BRIGHTEN. If your student ID ends with an odd number, select Concrete_Data_A dataset, and if your student ID ends with an even number, select Concrete_Data_B dataset. Using the Python attributes, function and libraries to solve the following problems.
a) Data Cleaning and Preprocessing:
- Use Pandas to load the dataset. Name the dataframe as concrete_df_XXX.
- Remove ‘Number’ column using .drop() function and visualize the first ten (10)
rows of the data. - Handle any missing values by dropping or replacing the empty cells. Check for missing values using functions like .info() or .isnull().sum()
- Convert the data frame to array, using to_numpy() function.
- Divide the data into two sets of data with division of 80% and 20% for train and test data, respectively. Name the dataset as train_data_XXX and test_data_XXX
Get Solution of this Assessment. Hire Experts to solve this assignment for you Before Deadline.
b) Data Analysis:
- Calculate the correlation between the variables in the dataframe.
- Utilize NumPy and Pandas to calculate summary statistics of the data such as
maximum, minimum, standard deviation, average, median and mode of each
category. - Use Pandas functions like .describe() for an overview of summary statistics and apply NumPy functions for specific calculations.
c) Visualization:
- Use Matplotlib to create visualizations such as line plots for train and test data
across all categories. - Generate histogram plots and box plots for all variables.
- Ensure that the visualizations are clear, informative, and aesthetically pleasing.
- Customize your plots by adding the titles, labels and legends
Stuck in Completing this Assignment and feeling stressed ? Take our Private Writing Services.
Get Help By Expert
Do you need assistance with CMPF104: Programming For Foundation In Engineering assignments? Our assignment helper in Malaysia offers expert help. We specialize in programming assignment writing to ensure your academic success. Let us handle your coursework while you focus on learning. Invest in your education with our reliable services for top-notch quality and improved performance.
Answer
Recent Solved Questions
- Master of Policy Sciences Assignment, CU, Malaysia Students are required to extract data on the performance of a country in terms of sustainability policies and management practices
- Fundamentals of Finance Assignment, MUM, Malaysia PYG Berhad inventory is held on average for 50 days, account receivable is collected in 20 days, and account payable
- SQQS5073 Structural Equation Modelling Individual Assignment 1 Universiti Utara Malaysia
- STA404: Statistics for Business and Social Sciences Assignment, UiTM, Malaysia Healthcare facilities are essential for providing quality and accessible health services to the population
- F79MA: Statistical Model Assignment, HWU, Malaysia Suppose that you are a trainee actuary working in the mathematical modeling team for a non-governmental organization that is rolling out a micro-credit scheme to support rural communities in developing countries
- MGT555: Business Analytics Assignment, UiTM, Malaysia Calculate the frequency distribution of customer demographics such as age, gender, race, education, and occupation
- BBCA1033 Introduction to Accounting Assignment, CUM, Malaysia
- FINAL ASSESSMENT SEMESTER III SESI Universiti Teknologi Malaysia : PREVENTING ACCIDENT AT WORKPLACE
- HR Management Report, OUM, Malaysia Sedap Fried Chicken Bhd is a new and rapid growing fast-food company in Malaysia
- FIN10004: Customers in the Corporate segment are often more likely to place larger orders than their consumer counterpart: Financial Statistic, Report, UTM, Malaysia