Crowdfunding platforms like Kickstarter and Indiegogo have been growing in success and popularity since the late 2000s. From independent content creators to famous celebrities, more and more people are using crowdfunding to launch new products and generate buzz, but not every project has found success. To receive funding, the project must meet or exceed an initial goal, so many organizations dedicate considerable resources looking through old projects in an attempt to discover “the trick” to finding success. need to organize and analyze a database of 1,000 sample projects to uncover any hidden trends.
-
Used the Excel workbook, modified and analyzed the sample-project data and tried to uncover market trends.
-
Used
**conditional formatting**
to fill each cell in theoutcome
column with a different color, depending on whether the associated campaign was successful, failed, canceled, or is currently live. -
Created a new column called Percent Funded that uses a formula to find how much money a campaign made relative to its initial funding goal.
-
Used
conditional formatting
to fill each cell in the Percent Funded column according to a three-color scale. The scale should start at 0 with a dark shade of red, and it should transition to green at 100 and blue at 200. -
Created a new column called
Average Donation
that uses a formula to find how much each project backer paid on average. -
Created two new columns, one called
Parent Category
and another calledSub-Category
, that use formulas tosplit
the Category and Sub-Category column into the two new, separate columns.
- Created a new sheet with a
pivot table
that analyzes initial worksheet to count how many campaigns were successful, failed, canceled, or are currently live per category. - Created a
stacked-column pivot chart
that can be filtered by country based on the table which is created.
- Created a new sheet with a pivot table that analyzes your initial sheet to count how many campaigns were successful, failed, or canceled, or are currently live per sub-category.
- Created a
stacked-column pivot chart
that can be filtered by country and parent category based on the table which is created. - The dates in the deadline and launched_at columns use Unix timestamps. Fortunately for us, this formulaLinks to an external site. that can be used to convert these timestamps to a normal date.
- Created a new column named
Date Created
Conversion that will use this formulaLinks to an external site. to convert the data contained in launched_at into Excel's date format. - Created a new column named
Date Ended Conversion
that will use this formulaLinks to an external site. to convert the data contained in deadline into Excel's date format. - Created a new sheet with a
pivot table
that has a column ofoutcome
, rows of Date Created Conversion, values based on the count of outcome, andfilters
based on parent category and Years. - Now, created a
pivot-chart line graph
that visualizes this new table. - Created a report in Microsoft Word, and for answering the required questions:(PDF file is included)
-
For Crowdfunding Goal Analysis, Created a new sheet with 8 columns with required column and row headers.
- Used the
COUNTIFS()
formula, for counting how many successful, failed, and canceled projects were created with goals within the ranges listed as per table sheet created. Populate the Number Successful, Number Failed, and Number Canceled columns with these data points. - Added each of the values in the Number Successful, Number Failed, and Number Canceled columns to populate the Total Projects column. Then, using a
mathematical formula
, found out the 19. percentage of projects that were successful, failed, or canceled per goal range. - Created a
line chart
that graphs the relationship between a goal amount and its chances of success, failure, or cancellation.
21. For Statistical Analysis
, Most people would use the number of campaign backers to assess the success of a crowdfunding campaign. Creating a summary statistics table
is one of the most efficient ways that data scientists can characterize quantitative metrics, such as the number of campaign backers.
22. For gaining an in-depth understanding of campaign backers, evaluated the number of backers of successful and unsuccessful campaigns by creating your own summary statistics
table.
23. Created a new worksheet in workbook, and created one column for the number of backers of successful campaigns and one column for unsuccessful campaigns.
24. Used Excel
to evaluate the following values for successful campaigns, and then do the same for unsuccessful campaigns:
The mean
number of backers, The median
number of backers, The minimum number
of backers, The maximum number
of backers, The variance
of the number of backers, The standard deviation
of the number of backers
25. Used data to determine whether the mean
or the median
better summarizes the data.
26. Used data to determine if there is more variability with successful or unsuccessful campaigns. Does this make sense? Why or why not?
- Given the provided data, what are three conclusions that we can draw about crowdfunding campaigns?
- What are some limitations of this dataset?
- What are some other possible tables and/or graphs that we could create, and what additional value would they provide?
Before creating a report using the data that was provided, let's examine the information and make some deductions:
-
Campaigns That Are Successful: Most of the dataset's crowdsourcing initiatives are profitable. In a variety of areas, including Theater, Music, and Film & Video, successful campaigns outweigh canceled, unsuccessful, or in-progress ones. This implies that raising money for different initiatives through crowdfunding can be successful.
-
Campaign Success Rate by Category: Some categories have higher campaign success rates than others. For instance, compared to journalism, which has very few effective campaigns, theater has many. This suggests that depending on the project's categorization, crowdfunding campaigns may or may not be successful.
-
Temporal Trends: The data reveals variations in the monthly total number of campaigns. Crowdfunding campaigns, for example, peak in the summer (June, July) and decline towards the end of the year (November, December). This implies that a crowdfunding campaign's success could be impacted by when it is launched.
-
Absence of Context: The dataset does not provide campaign details, financing targets, or campaign duration, although it does include counts of outcomes for various categories and months. Without this knowledge, it is difficult to comprehend the variables affecting the success or failure of a campaign.
-
Missing Data: The analysis as a whole may be impacted by the absence of information for a few categories or months. Results for several parent categories, such "audio" and "world music," are absent from the dataset, for example.
-
Aggregate Data: You can't undertake in-depth research or pinpoint specific trends within subcategories since the data is aggregated at a high level (e.g., counts of outcomes by category or month).
-
Success Rate by Category: Make a table or bar graph that displays the proportion of campaigns that are successful for each category. This would reveal which categories have the highest rates of success.
-
Trend Analysis Over Time: For each outcome category, create line graphs that display the number of campaigns over time (e.g., by month or year). This would make temporal trends easier to see and patterns in campaign activity easier to spot.
-
Geographical Analysis: Examine the distribution of results by nation or region and, if possible, incorporate data on the geographic locations of campaigns. This may shed light on geographical variations in the success of crowdsourcing.
-
Analyze the relationship between the length of the campaign and its result (success, failure, etc.). Make a box plot or histogram that displays the distribution of campaign durations for campaigns that were successful and unsuccessful.