Data Analysis is a part of Statistics.
Data Analysis is also a part of Computer Science.
Hence, it is expected that any Statistics and/or Computer Science student should know the descriptive statistics/analysis
of data.
This project is designed to measure your knowledge and understanding of data analysis by describing a raw data/datset, presenting the data, calculating the
descriptive statistics of the data, and interpreting your results.
(1.) This is an individual project. It is not a group project.
Students may work together. However, each student must submit the individual project.
(2.) Each student will work with real-world raw data of a variable.
Please identify the variable and the type of variable.
Specify the year when the raw data was obtained.
All information used for this project should be verifiable on the direct website of the
company/organization/government.
(I.) The sample size of the real-world data should be at least ten (10)
(II.) Textbook examples/exercises are NOT allowed.
You may use any of these datasets.
(a.) MyLab Math (MLM) Datasets: You may use any of the applicable datasets from your MLM assignments if the sample size is at least 30. (n ≥ 10)
(b.) Datasets from the U.S Government website: United States Government's Open Data: Datasets
You may use any of the applicable open datasets from the U.S government.
OR
(c.) You can browse/search for real-world data/datasets from the United States Government website on
Datasets
(https://catalog.data.gov/dataset)
(d.) You may use any other real-world dataset of raw data that you find from any company or any academic institution
that gives you the right to use their raw dataset.
(e.) You may also conduct your own research and collect raw data.
We have covered Data Collection
If you decide to conduct your own research, please email me for pre-approval.
If you cannot find any raw data after trying the aforementioned three ways, please let me know.
(3.) Write the complete address of the direct page of the website where you found the data.
*If the "direct" web address is too long, please shorten it by pasting the "complete web address" into
www.tinyurl.com
*
*This is only for traditional students (onsite) students*
*Generate a short address and write that address as is
For online students, please copy and paste the link as is
Please set the link to open in a new window.
(4.) I understand some of you do not want to type directly in the Blackboard editor using the Math Editor.
You may:
(a.) Show all your work on paper, write all math terms appropriately, take clear screenshots of your entire work, and insert those screenshots
as images directly on the Blackboard editor. OR
(b.) Show all your work in Microsoft Word, write all math terms appropriately, take clear screenshots of your entire work, and insert those screenshots
as images directly on the Blackboard editor.
Please DO NOT submit any attachment on Blackboard. It will not be clicked. It will not be opened. It will not be graded.
(5.) If you need feedback on your project before submission, please submit your project as a draft in the Project Drafts forum on Blackboard.
You may also send it to me as an attachment via email. I shall review and provide feedback.
Draft projects are to be submitted in the Project Drafts forum. Draft projects are not graded.
Actual projects that needs to be graded should be submitted in the actual Project forum.
(6.) Research Skills: Cite your source properly. Use APA, MLA, or Chicago Manual of Style. Indicate the style
you used.
(7.) Writing Skills: Write or type the "raw" data entirely.
(8.) Mathematical Skills (For Only Statistics Students):
(a.) Data Presentation: Use an appropriate data presentation tool to present your data. You can use any appropriate tool
in Pearson Statcrunch or Microsoft Excel among others.
Describe your data based on the presentation (skewness, etc.)
Mathematical Skills (For Statistics and Computer Science/Information Technology Students):
(b.) Measures of Center: Calculate the mean, median, mode, and midrange of the data.
Interpret your results with reference to the dataset.
(c.) Measures of Spread: Calculate the range, variance, and standard deviation of the data.
Interpret your results with reference to the dataset.
(d.) Measures of Position: Calculate the five-number summary of the data.
Interpret your results with reference to the dataset.
I shall use my calculator to check your work and your answers.
Please ensure you get the same results as the results on my calculators.
(9.) Programming Skills (For Only Computer Science/Information Technology Students):
(a.) You may use the raw data as an array or an ArrayList or a Vector or as a file as applicable.
Develop a program that computes the:
(b.) Measures of Center: Calculate the mean, median, mode, and midrange of the data.
Interpret your results with reference to the dataset.
(c.) Measures of Spread: Calculate the range, variance, and standard deviation of the data.
Interpret your results with reference to the dataset.
(d.) Measures of Position: Calculate the five-number summary of the data.
Interpret your results with reference to the dataset.
I shall use my calculators to check your work and your answers.
Please ensure you get the same results as the results on my calculators.
(e.) Write comments accordingly.
(f.) Upload all project files (the entire project folder) in the appropriate area in the Blackboard gradebook.
Please NOTE:
For Beginning C++, Beginning VB.BET, Beginning C#, Beginning Java, JavaScript, ASP.NET:
you may use Functional Programming and/or Object-oriented Programming
For Advanced C++, Advanced VB.NET, Advanced C#, Advanced Java: Object-oriented Programming is required.
(g.) Submit a Reflection Journal. Include your challenges, and how you overcame those challenges.
Please review the rubric for the criteria to be assessed.
(10.) All work must be turned in by the final due date to receive credit.
Any work beyond the final due date will not be accepted.
Name: | Your name |
Date: | The date |
Instructor: | Samuel Chukwuemeka |
Project: | Descriptive Statistics of the United States Manufactured Housing Shipments $2019$ Data for the $50$ States |
Company/Government: | Census.gov (https://www2.census.gov/programs-surveys/mhs/visualizations/2019/2019usmapbystate.pdf?#) |
Objectives: |
(1.) Present a dataset using a histogram. (2.) Describe the dataset. (3.) Determine the measures of central tendency of a dataset. (4.) Determine the measures of variation of a dataset. (5.) Determine the measures of location of a dataset. |
Variable/Type: | Manufactured Housing Shipments / Discrete variable |
Year Obtained: | $2019$ |
Citation: | Indicate the type of citation format. Cite your source accordingly. |
Please Note:
This is part of the Reflection (not the entire Reflection) of the Final Project.
Please review the Final Project Reflection samples provided for you in your course.
The teacher should guide each student to the successful completion of the project.
Let students know you are willing to help.
$1331$ | $1566$ | $3890$ | $553$ | $810$ | $314$ | $143$ | $342$ | $2402$ | $885$ |
$1406$ | $298$ | $238$ | $261$ | $860$ | $1981$ | $15866$ | $847$ | $581$ | $1291$ |
$1565$ | $4360$ | $657$ | $1313$ | $4203$ | $2180$ | $2792$ | $2716$ | $3478$ | $4546$ |
$1828$ | $1074$ | $1101$ | $4871$ | $4079$ | $3649$ | $7819$ | $1862$ | $1610$ | $144$ |
$394$ | $635$ | $190$ | $26$ | $100$ | $596$ | $345$ | $128$ | $89$ | $14$ |
Because we have to find the median and the five-number summary, it is better to sort the data.
Because we have five columns and ten rows, I prefer to sort by columns (the smaller number: $5 \lt 10$; because I shall read the data that way)
It is my preference. Please do what you prefer.
The sorted data is:
$14$ | $143$ | $298$ | $553$ | $810$ | $1101$ | $1565$ | $1981$ | $3478$ | $4360$ |
$26$ | $144$ | $314$ | $581$ | $847$ | $1291$ | $1566$ | $2180$ | $3649$ | $4546$ |
$89$ | $190$ | $342$ | $596$ | $860$ | $1313$ | $1610$ | $2402$ | $3890$ | $4871$ |
$100$ | $238$ | $345$ | $635$ | $885$ | $1331$ | $1828$ | $2716$ | $4079$ | $7819$ |
$128$ | $261$ | $394$ | $657$ | $1074$ | $1406$ | $1862$ | $2792$ | $4203$ | $15866$ |