{"id":6150,"date":"2019-07-26T10:51:29","date_gmt":"2019-07-26T02:51:29","guid":{"rendered":"http:\/\/www.finereport.com\/en\/?p=6150"},"modified":"2019-07-26T10:51:31","modified_gmt":"2019-07-26T02:51:31","slug":"data-analysis-practice-guide-how-to-begin","status":"publish","type":"post","link":"https:\/\/frg.fineres.com\/en\/2019\/07\/26\/data-analysis-practice-guide-how-to-begin\/","title":{"rendered":"Data Analysis Practice Guide\u2014\u2014How to begin"},"content":{"rendered":"\n<p>Many beginners are confused about how to\nlearn data analysis. Today, I will introduce the whole process of data analysis\nto answer your doubts and open up new ideas.<\/p>\n\n\n\n<p>Now, you already know the importance of data analysis in modern society. Mastering the data means mastering the law. When you understand the market data and analyze it, you can get the market rules. When you master the data of the product itself, analyze it, you can understand the user source of the product, user portraits and so on. Data analysis is so important, it is not only the &#8220;data structure + algorithm&#8221; of the new era, but also the high ground for enterprises to compete for talents.<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" width=\"1000\" height=\"667\" src=\"http:\/\/www.finereport.com\/en\/wp-content\/uploads\/2019\/07\/2019072601L.png\" alt=\"\" class=\"wp-image-6151\" srcset=\"https:\/\/frg.fineres.com\/en\/wp-content\/uploads\/2019\/07\/2019072601L.png 1000w, https:\/\/frg.fineres.com\/en\/wp-content\/uploads\/2019\/07\/2019072601L-300x200.png 300w, https:\/\/frg.fineres.com\/en\/wp-content\/uploads\/2019\/07\/2019072601L-768x512.png 768w\" sizes=\"(max-width: 1000px) 100vw, 1000px\" \/><\/figure>\n\n\n\n<h3><strong>What is the process of data analysis?<\/strong><\/h3>\n\n\n\n<p>Data analysis is mainly divided into three\nsteps<\/p>\n\n\n\n<p>1. Data Collection<\/p>\n\n\n\n<p>That is to take raw materials, we can&#8217;t\nanalyze without data.<\/p>\n\n\n\n<p>2. Data Mining<\/p>\n\n\n\n<p>Data mining is the value of the entire\nbusiness. The core of data mining is to mine the commercial value of data,\nwhich is what we call business intelligence.<\/p>\n\n\n\n<p>3. Data Visualization<\/p>\n\n\n\n<p>Simply put, let us intuitively understand the results of data analysis.<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" width=\"1024\" height=\"533\" src=\"http:\/\/www.finereport.com\/en\/wp-content\/uploads\/2019\/07\/2019072602L-1024x533.png\" alt=\"\" class=\"wp-image-6152\" srcset=\"https:\/\/frg.fineres.com\/en\/wp-content\/uploads\/2019\/07\/2019072602L-1024x533.png 1024w, https:\/\/frg.fineres.com\/en\/wp-content\/uploads\/2019\/07\/2019072602L-300x156.png 300w, https:\/\/frg.fineres.com\/en\/wp-content\/uploads\/2019\/07\/2019072602L-768x400.png 768w, https:\/\/frg.fineres.com\/en\/wp-content\/uploads\/2019\/07\/2019072602L.png 1518w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>Talking like this may be too simple, let me introduce you to these three steps in detail.<\/p>\n\n\n\n<h4><strong>Data collection<\/strong><\/h4>\n\n\n\n<p>In the data collection section, you usually\nwork with different data sources and then use tools to collect them.<\/p>\n\n\n\n<p>On the web you can collect a wide variety of data sets. There are also many tools that can help you automatically scrape data. Of course, if you write a Python crawler, it will be even more efficient. The fun of mastering Python crawlers is endless. It not only allows you to get hot reviews on social media, automatically downloads posters with keywords, but also automatically adds fans to your account, giving you the thrill of automation.<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" width=\"1024\" height=\"567\" src=\"http:\/\/www.finereport.com\/en\/wp-content\/uploads\/2019\/07\/2019072603L-1-1024x567.png\" alt=\"\" class=\"wp-image-6155\" srcset=\"https:\/\/frg.fineres.com\/en\/wp-content\/uploads\/2019\/07\/2019072603L-1-1024x567.png 1024w, https:\/\/frg.fineres.com\/en\/wp-content\/uploads\/2019\/07\/2019072603L-1-300x166.png 300w, https:\/\/frg.fineres.com\/en\/wp-content\/uploads\/2019\/07\/2019072603L-1-768x425.png 768w, https:\/\/frg.fineres.com\/en\/wp-content\/uploads\/2019\/07\/2019072603L-1.png 1608w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<h4><strong>Data mining<\/strong><\/h4>\n\n\n\n<p>The second part is data mining, which can\nbe compared to the &#8220;algorithm&#8221; part of the entire data analysis\nprocess.<\/p>\n\n\n\n<p>First you need to know its basic flow, the\ntop ten algorithms, and the mathematical foundation behind it.<\/p>\n\n\n\n<p>In this part, we will come into contact\nwith some concepts, such as association analysis, Adaboost algorithm, etc. You\nmay just have a little knowledge of these concepts. It doesn&#8217;t matter. I will\nintroduce this knowledge to you in detail later.<\/p>\n\n\n\n<p>Mastering data mining is like holding a\ncrystal ball. It uses historical data to tell you what will happen in the\nfuture.<\/p>\n\n\n\n<p>Of course it will also tell you how confident this is. I will also explain the definition of confidence in a later article.<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" width=\"1024\" height=\"592\" src=\"http:\/\/www.finereport.com\/en\/wp-content\/uploads\/2019\/07\/2019072604L-1-1024x592.png\" alt=\"\" class=\"wp-image-6156\" srcset=\"https:\/\/frg.fineres.com\/en\/wp-content\/uploads\/2019\/07\/2019072604L-1-1024x592.png 1024w, https:\/\/frg.fineres.com\/en\/wp-content\/uploads\/2019\/07\/2019072604L-1-300x173.png 300w, https:\/\/frg.fineres.com\/en\/wp-content\/uploads\/2019\/07\/2019072604L-1-768x444.png 768w, https:\/\/frg.fineres.com\/en\/wp-content\/uploads\/2019\/07\/2019072604L-1.png 1744w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<h4><strong>Data visualization<\/strong><\/h4>\n\n\n\n<p>The third is data visualization, which is a\nvery important step that we are particularly interested in. Data is often\nimplicit, especially when data is large, and visualization is a good way to\nunderstand the structure of the data and the presentation of the results. How\nto visualize data? There are two ways.<\/p>\n\n\n\n<p>The first is to use Python. In the process\nof cleaning and mining data in Python, we can use third-party libraries such as\nMatplotlib and Seaborn to render.<\/p>\n\n\n\n<p>The second is to use third-party tools. If you have already generated a csv format file and want to use WYSIWYG to render it, you can use third-party tools such as Data GIF Maker, Tableau, <a href=\"http:\/\/www.finereport.com\/en\/\">FineReport<\/a>, etc., which can easily process the data and help you make the presentation. <\/p>\n\n\n\n<p>The principles of data collection and data visualization are simple and easy to understand. These two parts focus on the mastery of the tools, so I will focus on introducing the use of tools.<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" width=\"1024\" height=\"479\" src=\"http:\/\/www.finereport.com\/en\/wp-content\/uploads\/2019\/07\/2019072605L-1024x479.png\" alt=\"\" class=\"wp-image-6157\" srcset=\"https:\/\/frg.fineres.com\/en\/wp-content\/uploads\/2019\/07\/2019072605L-1024x479.png 1024w, https:\/\/frg.fineres.com\/en\/wp-content\/uploads\/2019\/07\/2019072605L-300x140.png 300w, https:\/\/frg.fineres.com\/en\/wp-content\/uploads\/2019\/07\/2019072605L-768x359.png 768w, https:\/\/frg.fineres.com\/en\/wp-content\/uploads\/2019\/07\/2019072605L.png 1138w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>Of course, these theories are relatively abstract, so I think the best way to learn data analysis is to use them in tools and deepen understanding in projects.<\/p>\n\n\n\n<h3><strong>Practice guide<\/strong><\/h3>\n\n\n\n<p>Just now we talked about the data analysis\npanorama, including data acquisition, data mining, and data visualization. You\nmay feel that there are a lot of things, you can&#8217;t start, or you feel that data\nmining involves many algorithms, and some are difficult to master. In fact,\nthese are unnecessary troubles.<\/p>\n\n\n\n<p>Here we introduce the MAS (Multi-dimension,\nAsk, Share) learning method. With this method, learning data analysis is a\nprocess from &#8220;thinking&#8221; to &#8220;tool&#8221; to &#8220;practice&#8221;.\nToday I will share my learning experience with you from more angles. We can\ncall today&#8217;s content a &#8220;practice guide.&#8221;<\/p>\n\n\n\n<p><strong>We turn knowledge into our own language,\nand it really becomes our own thing.<\/strong> The process of\nthis transformation is the process of cognition.<\/p>\n\n\n\n<p>So how to improve your ability of learning?\nSimply put, it is to &#8220;know and do.&#8221;<\/p>\n\n\n\n<p>If cognition is the brain, tools are like\nour hands, and data engineers and algorithm scientists deal with the tools\nevery day.<\/p>\n\n\n\n<p>If you start to do data analysis projects, have already thought about the algorithm model of data mining in your mind, please keep in mind the following two principles.<\/p>\n\n\n\n<h4><strong>1.Do not repeat producing wheels<\/strong><\/h4>\n\n\n\n<p>As an example of data collection, I have\nseen many companies that have data collection needs. They think that some tools\ncan&#8217;t meet their individual needs, so they decided to recruit people to do this\nwork. What happened? After more than a year of practice, the wages invested\nhundreds of thousands, found a lot of bugs, and finally chose third-party\ntools. At this time, in fact, with timely assessment of need, and cooperation\nwith FineReport, you can save losses in a timely manner.<\/p>\n\n\n\n<h4><strong>2. Tools determine efficiency<\/strong><\/h4>\n\n\n\n<p>\u201cDon&#8217;t repeat producing wheels\u201d means you\nfirst need to find a wheel that can be used, which is a tool. How do we choose?<\/p>\n\n\n\n<p>It depends on the work you are going to do.\nThe tools are not good or bad, only suitable or not. In addition to\nresearch-type work, in most cases, engineers will choose the most user-friendly\ntools. Because: Bug is , documents are complete, and there are many cases.<\/p>\n\n\n\n<p>For example, Python has a lot of\nthird-party libraries for handling data mining. These libraries have a large\nnumber of users and help files to help you get started.<\/p>\n\n\n\n<p>In the following lessons, I will introduce\nyou to the most commonly used tools that will make your data mining more\neffective.<\/p>\n\n\n\n<p>After choosing a good tool, all you have to\ndo is accumulate \u201cassets\u201d. It&#8217;s hard to remember a lot of knowledge points, and\nwe can&#8217;t follow the instructions of the tools, but we can usually remember the\nstories, the projects we have done, and the problems we have done. These topics\nand projects are your first &#8220;assets&#8221;.<\/p>\n\n\n\n<p>How to quickly accumulate these\n&#8220;assets&#8221;? Here I send you a word: proficiency. Solving the problems\nis only the first step. The key is to train the \u201cproficiency\u201d used by our\ntools.<\/p>\n\n\n\n<p>As proficiency increases, your thinking cognitive model is gradually improving, and efficiency will naturally increase.<\/p>\n\n\n\n<h3><strong>Conclusion<\/strong><\/h3>\n\n\n\n<p>Cognitive trilogy, from cognition to tools\nto actual combat, is the learning advice I most want to share with you. After\nreading this article, be sure to start practicing!<\/p>\n\n\n\n<p><em>Finally, to learn about the next tutorial, welcome to follow <\/em><a href=\"https:\/\/www.facebook.com\/finereport\/\"><em>FineReport Reporting Software.<\/em><\/a><\/p>\n\n\n\n<h2>You might also be interested in\u2026<\/h2>\n\n\n\n<p><a href=\"http:\/\/www.finereport.com\/en\/about-finereport\/how-can-beginners-design-cool-data-visualizations.html\">How Can Beginners Design Cool Data Visualizations?<\/a><\/p>\n\n\n\n<p><a href=\"http:\/\/www.finereport.com\/en\/data-visualization\/a-beginners-guide-to-business-dashboards.html\">A Beginner\u2019s Guide to Business Dashboards<\/a><\/p>\n\n\n\n<p><a href=\"http:\/\/www.finereport.com\/en\/product-functions\/pure-report-operated-heat-map-tutorial-without-one-line-of-code.html\">Pure report-operated heat map tutorial without one line of code!<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>An article to tell you how to begin learning data analysis!<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[161],"tags":[151],"_links":{"self":[{"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/posts\/6150"}],"collection":[{"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/comments?post=6150"}],"version-history":[{"count":1,"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/posts\/6150\/revisions"}],"predecessor-version":[{"id":6159,"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/posts\/6150\/revisions\/6159"}],"wp:attachment":[{"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/media?parent=6150"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/categories?post=6150"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/tags?post=6150"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}