The Smith College Museum of Art (SCMA) staff want to better understand how people use the museum website. Project 1 will focus on real SCMA data spanning from July 2021 to June 2022. Your goal is to create a portfolio of data visualizations which help communicate the usage patterns of the SCMA website to museum staff. You can think of this project as a formalized exploratory data analysis. At the end of Project 1, your visualizations and accompanying reports will be given to the museum staff for them to learn from!
Click here for the Github Classroom Assignment for Project 1.
Project 1 will be completed in self-organized teams of 3 to 4. Each member will be creating their own visualizations, but each visualization from the team will be combined into one portfolio. You should not expect the museum staff to be familiar with data, and should explain things in such a way that anyone could understand.
All of the data (aside from the shop data) are exported from the museum’s Google analytics platform. You can find the documentation for Google analytics here; you will need to use it to reference what some of the measures in the data mean. You do not need to use every data file; just use what is relevant to your exploration. You can choose to supplement the museum data with other data if you would like. The museum data files include the following:
You can find the data files hosted on the course Moodle here. You should unzip the file, and place all of files contained inside the data/
folder within this project. You should not share these data files anywhere, including your project Github repo.
You have 2 weeks (until start of class 10/28) to work on this project, including one full day of class time (10/14). You have full freedom to make the project as simple or intricate as you desire. Each member of the team must make at least one data visualization. You can also create tables or other summaries to help explain the data to the reader. Your final submission should include the following:
In your team Github repo:
Through Moodle (Turn in here):
The team portfolio should be created using Quarto in R studio. I recommend you output to a PDF if you have only static visualizations, or html if your visualizations are interactive; you can choose other formats if they work better for your project. You will commit your rendered report in the docs/
directory of this project.
Each team will share one github repo. This repo should include all of your code. I recommend creating your visualizations in separate files, and then combining them in your Quarto portfolio document as a separate step.