Udemy Data Analytics ProjectI carried out data analysis on Udemy courses from different subjects to understand where opportunities to increase revenue may lie, and track performance of courses. Project overview & Business problem: I am working as a data analyst for the education tech company Udemy. I have been asked by my manager, Head of Curriculum at Udemy, to present the data on course revenue, and I have been provided with data on courses from different topics to understand where opportunities to increase revenue may lie, and track the performance of courses. My manager has suggested encouraging Web Development courses to charge more because she believes that these are the most popular courses. She needs to send a report to the CEO in the next three weeks with recommendations on how we can increase our next quarterly earnings. You can get the datasets using this link Tools used to carry out this project include:
Data Cleaning Process: Data cleaning is the process of preparing data for analysis by removing or modifying data that is incorrect, incomplete, irrelevant, duplicated, or improperly formatted. Data cleaning plays an important role in the analytical process and making sure that the answers we uncover are reliable and of a high quality. Below are the data cleaning processes I carried out using the power query editor in Microsoft Office Excel:
2. I changed the data type for each column to their correct data formats. Made Course Title column a text (string) data type.3. I added a conditional column which tells whether a course is “Free or Paid”. I used the data in the price column to populate this column. If price = 0 output will be “Free”, else “Paid”Result of the conditional column.4. I deleted irrelevant columns that would not be useful for my analysis. 5. I ensured that I had clear and concise names for headers and used dashes or underscores in between words to make it easier to parse later on. 6. I removed duplicate rows of data. 7. I removed blank rows. Tasks carried out on Microsoft SQL Server. After carrying out the above data cleaning processes on Microsoft Excel, I imported the cleaned datasets into Microsoft SQL Server Management Studio to write queries that would aid in generating key insights in the data. Asides writing queries to generate insights into the data, I also wrote queries to append the datasets using the UNION operator to make it a whole dataset, and I wrote queries to create a new column called “Revenue”. Revenue calculated as Price * Number of Subscribers. You can hit this link which will direct you to my GitHub where you can view my SQL Codes. Analysis and Insights. Note: In Microsoft Power BI, I wrote DAX formulas to create a calendar table, the calendar table contained the date, year, month, month number (to sort the month in order), and the quarters (Q1, Q2, Q3, and Q4). I also wrote DAX formulas to create Measures that I used to generate the visuals.
Power BI Visualization: Recommendations: To increase our next quarterly earnings we need to:
|