From bb956bb16d85e5012a5443230dc4d8c31333825f Mon Sep 17 00:00:00 2001 From: Carolyn Russell <111674383+Kenner82@users.noreply.github.com> Date: Tue, 17 Jan 2023 22:16:56 -0800 Subject: [PATCH 1/2] Update README.md --- README.md | 14 ++++++-------- 1 file changed, 6 insertions(+), 8 deletions(-) diff --git a/README.md b/README.md index f0dd63f..50c7c99 100644 --- a/README.md +++ b/README.md @@ -1,11 +1,9 @@ -# Kickstarting with Excel +# Analyzing Kickstarter Data With Excel -## Overview of Project - -### Purpose +## Purpose Evaluate the outcomes of Kickstarter campaigns based on launch date and financing goals to determine ideal conditions for launching a successful compaign. -## Analysis and Challenges +## Analysis ### Analysis of Outcomes Based on Launch Date Using information provided on the details of 4,114 Kickstarter compaigns launched between 2009-2017, data was first analyzed using a pivot table to determine the number of successful, failed, and canceled campaigns based on the months in which they were launched. It was further filtered by Parent Category to view only results labeled "theater". The option to filter by Year was also included, though not used in this analysis. @@ -23,7 +21,7 @@ This data was also visualized with a line graph. ![Outcomes_vs_Goals](https://user-images.githubusercontent.com/111674383/190318009-dc77cd18-885c-4f18-ae5c-7cee27570d17.png) -### Challenges and Difficulties Encountered +## Challenges And Difficulties In order to analyze the original dataset, additional columns had to be made to convert column data into usable category and date formats. The first formulas written using the CountIfs function used relative instead of absolute cell references. When the formulas were subsequently copied into other rows and columns, the columns referenced in the Kickstarter dataset automatically shifted and returned inaccurate calculations. A Testing sheet was added to the workbook, and within it a pivot table was used to count the number of each campaign outcome at individual funding amounts. These amounts were then grouped into the ranges used in the original calculations to check the accuracy of the data. This highlighted the error within the original formula, and further formulas used absolute cell references appropriately. @@ -40,11 +38,11 @@ Also within the CountIfs formulas, the data ranges are hard-coded into the calcu * For a successful campaign within the Plays subcategory, the ideal amount of money to ask for is less than $1,000. Of all the data ranges, this one has the highest percentage rate for success and the lowest for failure. * The next best option for a succesful Plays campaign is a goal amount from $1,000-$4,999. The percentages of success and failure at this level are very close to the lowest data range, but the number of campaigns launched is much greater (534 projects compared to 186). That provides a more robust data set to calculate with, particularly since the next highest rates of success have fewer than 10 total projects in that data range. -### Dataset Limitations +## Limitations * Both graphs differ on how they are filtered (one is by a Parent category, the other a Subcategory). To best compare the ideal conditions for a successful campaign, it would help to keep that variable consistent. * There are very few campaigns within the Plays subcategory in the higher goal amounts, which means that in some specific data ranges there are fewer than 10 total projects with which to calculate the percentages (a 67% success rate sounds pretty good until you realize there were only 3 total projects in that range). It would help to have a dataset that included information from a longer time span to confirm the accuracy of the percentages which are calculated using a small number of total projects. -### Additional Tables and/or Graphs +## Additional Analysis * It would be useful to have the same sets of graphs for each variable being evaluated. * That would include using a pivot table to calculate the raw number of successful, failed, and canceled projects when evaluated by launch date and funding goal. * It would also include calculations to convert the raw data to percentages when evaluating the outcomes by launch date and funding goal. From 1644c25974651780d04e81312cf5f35c6de6651e Mon Sep 17 00:00:00 2001 From: Carolyn Russell <111674383+Kenner82@users.noreply.github.com> Date: Tue, 17 Jan 2023 22:20:09 -0800 Subject: [PATCH 2/2] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 50c7c99..eda02e2 100644 --- a/README.md +++ b/README.md @@ -26,7 +26,7 @@ In order to analyze the original dataset, additional columns had to be made to c The first formulas written using the CountIfs function used relative instead of absolute cell references. When the formulas were subsequently copied into other rows and columns, the columns referenced in the Kickstarter dataset automatically shifted and returned inaccurate calculations. A Testing sheet was added to the workbook, and within it a pivot table was used to count the number of each campaign outcome at individual funding amounts. These amounts were then grouped into the ranges used in the original calculations to check the accuracy of the data. This highlighted the error within the original formula, and further formulas used absolute cell references appropriately. -Also within the CountIfs formulas, the data ranges are hard-coded into the calculations. Adding a column to transform the text data in the Goal column to mathmatical representations (e.g., "<1000" instead of "Less than 1000") would provide greater flexibility as well as fewer opportunities for errors. +Also within the CountIfs formulas, the data ranges are hard-coded into the calculations. In the future, adding a column to transform the text data in the Goal column to mathematical representations (e.g., "<1000" instead of "Less than 1000") would provide greater flexibility as well as fewer opportunities for errors. ## Results