{"id":9004,"date":"2019-12-19T17:36:41","date_gmt":"2019-12-19T09:36:41","guid":{"rendered":"https:\/\/www.finereport.com\/en\/?p=9004"},"modified":"2020-11-20T16:17:32","modified_gmt":"2020-11-20T08:17:32","slug":"how-do-super-rookies-start-learning-data-analysis","status":"publish","type":"post","link":"https:\/\/frg.fineres.com\/en\/2019\/12\/19\/how-do-super-rookies-start-learning-data-analysis\/","title":{"rendered":"How Do Super Rookies Start Learning Data Analysis?"},"content":{"rendered":"<p class=\"graf graf--p\">For super rookies, the first task is to understand what data analysis is.<\/p>\n<p class=\"graf graf--p\">Data analysis is a type of knowledge discovery that gains insights from data and drives business decisions.<\/p>\n<figure class=\"graf graf--figure\"><img class=\"graf-image\" src=\"https:\/\/cdn-images-1.medium.com\/max\/1600\/1*D2E0x9qt7V3lTHGTf_9XZg.jpeg\" data-image-id=\"1*D2E0x9qt7V3lTHGTf_9XZg.jpeg\" data-width=\"720\" data-height=\"536\" data-is-featured=\"true\" \/><figcaption class=\"imageCaption\">From Google<\/figcaption><\/figure>\n<p class=\"graf graf--p\">There are two points here. <strong class=\"markup--strong markup--p-strong\">One is how to gain insights from the data. <\/strong>Data is cold and can\u2019t speak. Professional data analysts must have a wealth of business knowledge in order to know from the data what has happened and what is about to happen. In addition, tools for data analysis and data mining are also important. Excel, Python, Power BI, Tableau, FineReport are frequently used by data analysts. However, many beginners often pay too much attention to tools and ignore the professional qualities that a data analyst should have.<\/p>\n<p class=\"graf graf--p\"><strong class=\"markup--strong markup--p-strong\">Another is how to drive business decisions.<\/strong> This may not be the level that ordinary data analysts can decide. But a good data analyst does need to have a keen business vision. Pure data analysis results are not helpful. Combining the analysis results with real scenes to produce instructive conclusions is the value of a data analyst.<\/p>\n<p class=\"graf graf--p\">I know novices are very concerned about the learning process of data analysis. You may be full of doubts and yearnings for SQL, Python, R, etc. This was also my mentality when I first came into contact with data analysis. There are so many things to learn, which one should I learn? How do I learn, and to what extent?<\/p>\n<p class=\"graf graf--p\">Now let me talk about the selection of data analysis tools.<\/p>\n<h2 class=\"graf graf--h3\"><strong class=\"markup--strong markup--h3-strong\">How to Choose Data Analysis\u00a0Tools?<\/strong><\/h2>\n<p class=\"graf graf--p\">In general, if you want to become an excellent data analyst, you should master at least three types of tools: <strong class=\"markup--strong markup--p-strong\">self-service BI tools, SQL, and programming languages<\/strong>. The selection criteria of these three types of tools are different.<\/p>\n<p class=\"graf graf--p\">For super rookies, the priority is to learn self-service tools, to ensure that they can get started with data analysis as soon as possible, and to master the basic knowledge of data analysis. Second, learn SQL and understand the concept of database. Finally, if you want to reach a higher level, you need to learn programming languages and even data analysis libraries. Next, I will introduce the specific selection one by one.<\/p>\n<h2 class=\"graf graf--h3\"><strong class=\"markup--strong markup--h3-strong\">1. Self-service BI\u00a0tools<\/strong><\/h2>\n<p class=\"graf graf--p\">What is a self-service analysis tool? In fact, it\u2019s a BI analysis tool specifically for business people, helping them get rid of the constraints of traditional IT and complete data analysis work independently. For the super rookie, the learning cost and threshold are relatively low, and it is easy to get started.<\/p>\n<p class=\"graf graf--p\">Taking <a class=\"markup--anchor markup--p-anchor\" href=\"https:\/\/www.finereport.com\/en\/?utm_source=medium&amp;utm_medium=media&amp;utm_campaign=blog&amp;utm_term=How%20Do%20Super%20Rookies%20Start%20Learning%20Data%20Analysis%3F\" target=\"_blank\" rel=\"noopener noreferrer\" data-href=\"https:\/\/www.finereport.com\/en\/?utm_source=medium&amp;utm_medium=media&amp;utm_campaign=blog&amp;utm_term=How%20Do%20Super%20Rookies%20Start%20Learning%20Data%20Analysis%3F\">FineReport<\/a> as an example, it is a <a href=\"https:\/\/www.finereport.com\/en\/bi-tools\/bi-reporting.html\" target=\"_blank\" rel=\"noopener noreferrer\">BI reporting tool<\/a> that can connect to various data sources, quickly analyze the data, and make various reports and cool <a class=\"markup--anchor markup--p-anchor\" href=\"https:\/\/www.finereport.com\/en\/features\/tv-dashboard?utm_source=medium&amp;utm_medium=media&amp;utm_campaign=blog&amp;utm_term=How%20Do%20Super%20Rookies%20Start%20Learning%20Data%20Analysis%3F\" target=\"_blank\" rel=\"noopener noreferrer\" data-href=\"https:\/\/www.finereport.com\/en\/features\/tv-dashboard?utm_source=medium&amp;utm_medium=media&amp;utm_campaign=blog&amp;utm_term=How%20Do%20Super%20Rookies%20Start%20Learning%20Data%20Analysis%3F\">dashboards<\/a>. Its designer interface is similar to Excel. You can complete real-time report through simple drag and drop operations. Its data entry system and support of decision-making platform provide a series of functions of data reporting, process approval, and authority management, which can flexibly respond to business needs such as operations, human resources, finance, and contracts.<\/p>\n<figure class=\"graf graf--figure\"><img class=\"graf-image\" src=\"https:\/\/cdn-images-1.medium.com\/max\/1600\/1*U8unLhDxF83POGNzu4Q8xQ.png\" data-image-id=\"1*U8unLhDxF83POGNzu4Q8xQ.png\" data-width=\"2464\" data-height=\"1154\" \/><figcaption class=\"imageCaption\">Application architecture of FineReport<\/figcaption><\/figure>\n<p class=\"graf graf--p\">In fact, <a class=\"markup--anchor markup--p-anchor\" href=\"https:\/\/www.finereport.com\/en\/?utm_source=medium&amp;utm_medium=media&amp;utm_campaign=blog&amp;utm_term=How%20Do%20Super%20Rookies%20Start%20Learning%20Data%20Analysis%3F\" target=\"_blank\" rel=\"noopener noreferrer\" data-href=\"https:\/\/www.finereport.com\/en\/?utm_source=medium&amp;utm_medium=media&amp;utm_campaign=blog&amp;utm_term=How%20Do%20Super%20Rookies%20Start%20Learning%20Data%20Analysis%3F\">FineReport<\/a> is like a combined version of Excel and Tableau. It can produce a variety of complex reports. At the same time, it also advocates visual exploratory analysis. It is a bit like an enhanced version of PivotTable. The visualization component library of FineReport is very rich. It can be used as a portal for data reporting, or as a platform for business analysis.<\/p>\n<figure class=\"graf graf--figure\"><img class=\"graf-image\" src=\"https:\/\/cdn-images-1.medium.com\/max\/1600\/1*B831mLVvwYLjKi09e60GuQ.gif\" data-image-id=\"1*B831mLVvwYLjKi09e60GuQ.gif\" data-width=\"1254\" data-height=\"607\" \/><figcaption class=\"imageCaption\">Reporting of FineReport<\/figcaption><\/figure>\n<figure class=\"graf graf--figure\"><img class=\"graf-image\" src=\"https:\/\/cdn-images-1.medium.com\/max\/1600\/1*Tq19mki3_VOp3Pb3br1j_w.gif\" data-image-id=\"1*Tq19mki3_VOp3Pb3br1j_w.gif\" data-width=\"1845\" data-height=\"870\" \/><figcaption class=\"imageCaption\">Dashboard of FineReport<\/figcaption><\/figure>\n<p class=\"graf graf--p\">For newbies, a tool with low learning difficulty but powerful analytical performance cannot be better. And more importantly, the personal version of FineReport is completely free, which can support individuals to conduct self-service analysis.<\/p>\n<p class=\"graf graf--p\">Of course, other BI tools such as Power BI and Qlikview also have their own advantages. If you want to learn more about self-service BI tools, you can take a look at this review: <a href=\"https:\/\/www.finereport.com\/en\/bi-tools\/top-5-bi-tools-of-2019-comparison-and-how-to-decide.html\"><strong class=\"markup--strong markup--p-strong\"><em class=\"markup--em markup--p-em\">5 Most Popular Business Intelligence (BI) Tools in 2019<\/em><\/strong><\/a>, to understand your own needs and then choose the tool that is right for you.<\/p>\n<h3 class=\"graf graf--h3\"><strong class=\"markup--strong markup--h3-strong\">2. SQL<\/strong><\/h3>\n<p class=\"graf graf--p\">Structured Query Language (SQL) is used to communicate with a database. It is a database query language for accessing data and querying, updating, and managing relational database systems. Common relational database management systems are SQL Server, MySQL, Oracle, MS Access, DB2, etc. Most database systems use SQL. Generally, companies will store data in local databases or public clouds. Some will use MySQL, Oracle, MongoDB, etc., and others will use big data storage format like HBase and Parquet.<\/p>\n<figure class=\"graf graf--figure\"><img class=\"graf-image\" src=\"https:\/\/cdn-images-1.medium.com\/max\/1600\/1*jGPAJaUegwnTVLtX3KXs-g.gif\" data-image-id=\"1*jGPAJaUegwnTVLtX3KXs-g.gif\" data-width=\"517\" data-height=\"480\" \/><\/figure>\n<p class=\"graf graf--p\">I will recommend beginners to learn SQL well, and then get to know about HBase and Parquet as needed.<\/p>\n<h3 class=\"graf graf--h3\"><strong class=\"markup--strong markup--h3-strong\">3. Programming Languages<\/strong><\/h3>\n<p class=\"graf graf--p\"><a class=\"markup--anchor markup--p-anchor\" href=\"https:\/\/www.python.org\/\" target=\"_blank\" rel=\"noopener noreferrer\" data-href=\"https:\/\/www.python.org\/\">Python<\/a> and <a class=\"markup--anchor markup--p-anchor\" href=\"https:\/\/www.r-project.org\/\" target=\"_blank\" rel=\"noopener noreferrer\" data-href=\"https:\/\/www.r-project.org\/\">R<\/a> are the two most widely used programming languages in the field of data analysis. I think both are suitable as the core language of data analysis, but it is better to choose one to learn.<\/p>\n<figure class=\"graf graf--figure\"><img class=\"graf-image\" src=\"https:\/\/cdn-images-1.medium.com\/max\/1600\/1*2QNQeyJl79wTqZkuin7efw.png\" data-image-id=\"1*2QNQeyJl79wTqZkuin7efw.png\" data-width=\"1158\" data-height=\"720\" \/><\/figure>\n<p class=\"graf graf--p\">Since many people have asked me questions about Python, and I also work with Python myself, here I will talk about the advantages and disadvantages of using Python for data analysis.<\/p>\n<p class=\"graf graf--p\">As a high-level programming language, the biggest disadvantage of Python is that it is not good at developing underlying applications. But except for that, Python can do almost anything. When it comes to data analysis, from database operations, data cleaning, <a class=\"markup--anchor markup--p-anchor\" href=\"https:\/\/towardsdatascience.com\/9-data-visualization-tools-that-you-cannot-miss-in-2019-3ff23222a927\" target=\"_blank\" rel=\"noopener noreferrer\" data-href=\"https:\/\/towardsdatascience.com\/9-data-visualization-tools-that-you-cannot-miss-in-2019-3ff23222a927\">data visualization<\/a>, to machine learning, batch processing, script writing, model optimization, and deep learning, all these functions can be implemented with Python, and different libraries are provided for you to choose.<\/p>\n<figure class=\"graf graf--figure\"><img class=\"graf-image\" src=\"https:\/\/cdn-images-1.medium.com\/max\/1600\/1*EuWxCh5Q5lLYI3wrt0rtGg.png\" data-image-id=\"1*EuWxCh5Q5lLYI3wrt0rtGg.png\" data-width=\"1216\" data-height=\"460\" \/><figcaption class=\"imageCaption\">From Google<\/figcaption><\/figure>\n<p class=\"graf graf--p\">In addition, Jupyter Notebook is also an excellent interactive tool for data analysis and provides a convenient experimental platform for beginners.<\/p>\n<h3 class=\"graf graf--h3\"><strong class=\"markup--strong markup--h3-strong\">4. Data Analysis Libraries<\/strong><\/h3>\n<p class=\"graf graf--p\">In addition to the three types of tools mentioned above, there is actually a type of data analysis library that is more suitable for advanced data analysts. If you are still a newbie, you can ignore this section.<\/p>\n<p class=\"graf graf--p\"><a class=\"markup--anchor markup--p-anchor\" href=\"https:\/\/pandas.pydata.org\/\" target=\"_blank\" rel=\"noopener noreferrer\" data-href=\"https:\/\/pandas.pydata.org\/\">Pandas<\/a> is a Python data science library that is constantly improving. Its data structure is very suitable for data processing. Pandas incorporates a large number of analysis function methods, as well as common statistical models and visualization processing. If you use Python for data analysis, during the data preprocessing process, almost 90% of the work needs to be completed using Pandas.<\/p>\n<p class=\"graf graf--p\"><a class=\"markup--anchor markup--p-anchor\" href=\"https:\/\/numpy.org\/\" target=\"_blank\" rel=\"noopener noreferrer\" data-href=\"https:\/\/numpy.org\/\">NumPy<\/a> is a numerical calculation library for Python. Many analysis libraries, including Pandas, are built on NumPy.<\/p>\n<p class=\"graf graf--p\">The core features of NumPy include:<\/p>\n<ul class=\"postList\">\n<li class=\"graf graf--li\">Ndarray, a fast and space-saving multidimensional array with vector arithmetic operation capabilities.<\/li>\n<li class=\"graf graf--li\">Standard mathematical functions for fast operations on entire sets of data (no need to write loops).<\/li>\n<li class=\"graf graf--li\">Tool for reading and writing disk data and for manipulating memory-mapped files.<\/li>\n<li class=\"graf graf--li\">Linear algebra, random number generation, and Fourier transform functions.<\/li>\n<li class=\"graf graf--li\">A C API for integrating code written in languages such as C, C ++, Fortran.<\/li>\n<\/ul>\n<p class=\"graf graf--p\">NumPy is especially important for numerical calculations because it can efficiently process large arrays of data. This is because:<\/p>\n<ul class=\"postList\">\n<li class=\"graf graf--li\">NumPy arrays use less memory than Python\u2019s built-in sequences.<\/li>\n<li class=\"graf graf--li\">NumPy can perform complex calculations on entire arrays without the need the For loop of Python.<\/li>\n<\/ul>\n<p class=\"graf graf--p\">Besides, <a class=\"markup--anchor markup--p-anchor\" href=\"https:\/\/matplotlib.org\/\" target=\"_blank\" rel=\"noopener noreferrer\" data-href=\"https:\/\/matplotlib.org\/\">Matplotlib<\/a> and <a class=\"markup--anchor markup--p-anchor\" href=\"https:\/\/seaborn.pydata.org\/\" target=\"_blank\" rel=\"noopener noreferrer\" data-href=\"https:\/\/seaborn.pydata.org\/\">Seaborn<\/a> are the main visualization tools of Python. It is recommended that everyone learn to learn. Data display and data analysis are equally important.<\/p>\n<p class=\"graf graf--p\">Alright, that \u2019s it for today \u2019s introduction of data analysis tools. If you want a more comprehensive getting started guide for data analysis, you can refer to the following articles:<\/p>\n<p><a href=\"https:\/\/www.finereport.com\/en\/data-analysis\/data-analysis-practice-guide-how-to-begin.html\"><strong class=\"markup--strong markup--blockquote-strong\">Data Analysis Practice Guide: How to Begin?<\/strong><\/a><\/p>\n<p><a href=\"https:\/\/www.finereport.com\/en\/data-analysis\/6-key-skills-that-data-analysts-need-to-master.html\"><strong class=\"markup--strong markup--blockquote-strong\">6 Key Skills That Data Analysts Need to Master<\/strong><\/a><\/p>\n<p><a href=\"https:\/\/www.finereport.com\/en\/data-analysis\/what-data-analysis-tools-should-i-learn-to-start-a-career-as-a-data-analyst.html\"><strong class=\"markup--strong markup--blockquote-strong\">What Data Analysis Tools Should I Learn to Start a Career as a Data Analyst?<\/strong><\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>A guide to help beginners quickly get started with data analysis!<\/p>\n","protected":false},"author":1,"featured_media":9005,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[161],"tags":[151],"_links":{"self":[{"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/posts\/9004"}],"collection":[{"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/comments?post=9004"}],"version-history":[{"count":3,"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/posts\/9004\/revisions"}],"predecessor-version":[{"id":11117,"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/posts\/9004\/revisions\/11117"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/media\/9005"}],"wp:attachment":[{"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/media?parent=9004"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/categories?post=9004"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/frg.fineres.com\/en\/wp-json\/wp\/v2\/tags?post=9004"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}