5 answers
Asked
825 views
Do I really need to know statistics for data mining ?
I'm taking classes at online university that I work at. I hate the statistic formulas and the different verbiage I have to know, I feel like it doesn't match with entry level job in data mining.
5 answers
Updated
Olawale’s Answer
Statistics is important for data mining because it helps you understand and analyze data effectively. Techniques like descriptive statistics (mean, median, variance) allow you to summarize data, while statistical tests like correlation and chi-square help uncover relationships between variables. Additionally, many machine learning algorithms used in data mining, such as regression and probabilistic models, are rooted in statistical principles, making a basic understanding of statistics essential.
You’ll need statistics for tasks like exploratory data analysis, feature selection, and model evaluation. For example, you can use statistical metrics to assess a model’s accuracy or significance, ensuring your findings are reliable. However, if you’re working with automated tools like Tableau, Power BI, or pre-built machine learning models, the tools often handle the statistical calculations, reducing the need for in-depth knowledge.
While you don’t need to be an expert, having a solid foundation in concepts like probability distributions, hypothesis testing, and regression is highly beneficial. These skills help you work more effectively, especially for advanced analytics or predictive modeling. If you’re unfamiliar with these concepts, starting with beginner-friendly resources can provide the understanding you need for data mining.
You’ll need statistics for tasks like exploratory data analysis, feature selection, and model evaluation. For example, you can use statistical metrics to assess a model’s accuracy or significance, ensuring your findings are reliable. However, if you’re working with automated tools like Tableau, Power BI, or pre-built machine learning models, the tools often handle the statistical calculations, reducing the need for in-depth knowledge.
While you don’t need to be an expert, having a solid foundation in concepts like probability distributions, hypothesis testing, and regression is highly beneficial. These skills help you work more effectively, especially for advanced analytics or predictive modeling. If you’re unfamiliar with these concepts, starting with beginner-friendly resources can provide the understanding you need for data mining.
Updated
Tejas’s Answer
Absolutely, understanding statistics is essential for data mining. Statistics provides the foundation for making sense of data, recognizing patterns, and building models. Many data mining techniques, such as clustering, regression, and classification, rely on statistical principles. By having a strong grasp of statistics, you can make better decisions about which methods to apply and how to interpret the results.
Here are a few great resources to help you strengthen your understanding of statistics:
StatQuest (YouTube Channel)
This channel breaks down complex statistical concepts into easy-to-understand videos. It's perfect for beginners and even intermediate learners who want to revisit topics in a clear, concise way.
3Blue1Brown (YouTube Channel)
Known for its visually intuitive explanations, 3Blue1Brown covers a variety of mathematical topics, including statistics, probability, and linear algebra—all of which are important for data mining. The visuals help make difficult topics easier to grasp.
Khan Academy
Another excellent resource for learning statistics, Khan Academy offers a structured, free course with practice problems to help reinforce concepts.
Here are a few great resources to help you strengthen your understanding of statistics:
StatQuest (YouTube Channel)
This channel breaks down complex statistical concepts into easy-to-understand videos. It's perfect for beginners and even intermediate learners who want to revisit topics in a clear, concise way.
3Blue1Brown (YouTube Channel)
Known for its visually intuitive explanations, 3Blue1Brown covers a variety of mathematical topics, including statistics, probability, and linear algebra—all of which are important for data mining. The visuals help make difficult topics easier to grasp.
Khan Academy
Another excellent resource for learning statistics, Khan Academy offers a structured, free course with practice problems to help reinforce concepts.
Updated
Alex’s Answer
Yes, you will want to have a good basis in statistics. I am afraid it is very important. The good news is - the computers and tools will do a lot of this for you when you are in the workforce .... but it will be important for you to know the basics to build your career. And - it will help more than you realize! Stock picking, sports betting, election tracking - here's a cool book recommendation -- On the Edge: The Art of Risking Everything
by Nate Silver. He has turned an understanding of stats into a VERY famous career!
by Nate Silver. He has turned an understanding of stats into a VERY famous career!
Updated
Randy’s Answer
Hello Merika,
It's fantastic to connect with you, and I'm thrilled to see your interest in Data Mining. Let's dive into a few key insights. I firmly believe that each individual should pursue what truly sparks their enthusiasm and brings them joy. This passion naturally leads you to excel in that particular field.
Data Mining and statistics go hand in hand. It involves the process of evaluating, scaling, quantifying, and identifying patterns. Even though you might be using software packages, it's essential to apply statistical principles and formulas.
My specialty lies in comprehending the business aspect of this equation, with a focus on practical application. Whether it's solving problems or optimizing for business value, that's where I thrive.
I hope this gives you some direction in carving out your own path. Wishing you all the best on this exciting journey!
It's fantastic to connect with you, and I'm thrilled to see your interest in Data Mining. Let's dive into a few key insights. I firmly believe that each individual should pursue what truly sparks their enthusiasm and brings them joy. This passion naturally leads you to excel in that particular field.
Data Mining and statistics go hand in hand. It involves the process of evaluating, scaling, quantifying, and identifying patterns. Even though you might be using software packages, it's essential to apply statistical principles and formulas.
My specialty lies in comprehending the business aspect of this equation, with a focus on practical application. Whether it's solving problems or optimizing for business value, that's where I thrive.
I hope this gives you some direction in carving out your own path. Wishing you all the best on this exciting journey!
Updated
Jeff’s Answer
Stats was my least favorite class but I do think it will be useful. While you may not have to do the calculations like you do in school, having that experience will get you familiar with terms, understanding why things are trending a certain way, etc. Wish I could recommend dropping it!
Delete Comment
Flag Comment