{"id":2310,"date":"2020-10-28T09:03:10","date_gmt":"2020-10-28T02:03:10","guid":{"rendered":"http:\/\/international.binus.ac.id\/computer-science\/?p=2310"},"modified":"2020-10-28T09:03:10","modified_gmt":"2020-10-28T02:03:10","slug":"what-is-data-science","status":"publish","type":"post","link":"https:\/\/international.binus.ac.id\/computer-science\/2020\/10\/28\/what-is-data-science\/","title":{"rendered":"What is Data Science"},"content":{"rendered":"<p>So you want to be a \u201cdata scientist\u201d?<\/p>\n<p>There is no widely accepted de\ufb01nition of who a data scientist is.<\/p>\n<ol>\n<li>Several books now attempt to de\ufb01ne what data science is and who a data scientist,<\/li>\n<li>It is likely to be an individual with multi-disciplinary training in computer science, business, economics, statistics, and armed with the necessary quantity of domain knowledge relevant to the question at hand. The potential of the \ufb01eld is enormous for just a few well-trained data scientists armed with big data have the potential to transform organizations and societies. In the narrower domain of business life, the role of the data scientist is to generate applicable business intelligence.<\/li>\n<\/ol>\n<p>Data science is transforming business. Companies are using medical data and claims data to offer incentivized health programs to employees. Caesar\u2019s Entertainment Corp. analyzed data for 65,000 employees and found substantial cost savings. Zynga Inc, famous for its game Farmville, accumulates 25 terabytes of data every day and analyzes it to make choices about new game features. UPS installed sensors to collect data on speed and location of its vans, which combined with GPS information, reduced fuel usage in 2011 by 8.4 million gallons, and shaved 85 million miles off its routes. 5 McKinsey argues that a successful data analytics plan contains three elements: interlinked data inputs, analytics models, and decision-support tools. 6 In a seminal paper, Halevy, Norvig and Pereira (2009), argue that even simple theories and models, with big data, have the potential to do better than complex models with less data.<\/p>\n<p>In a recent talk 7 well-regarded data scientist Hilary Mason emphasized that the creation of \u201cdata products\u201d requires three components: data (of course) plus technical expertise (machine-learning) plus people and process (talent). Google Maps is a great example of a data product that epitomizes all these three qualities. She mentioned three skills that good data scientists need to cultivate: (a) in math and stats, (b) coding, (c) communication. I would add that preceding all these is the ability to ask relevant questions, the answers to which unlock value for companies, consumers, and society. Everything in data analytics begins with a clear problem statement, and needs to be judged with clear metrics.<\/p>\n<p>Being a data scientist is inherently interdisciplinary. Good questions come from many disciplines, and the best answers are likely to come from people who are interested in multiple \ufb01elds, or at least from teams that co-mingle varied skill sets. Josh Wills of Cloudera stated it well \u201cA data scientist is a person who is better at statistics than any software engineer and better at software engineering than any statistician.\u201d In contrast, complementing data scientists are business analytics people, who are more familiar with business models and paradigms and can ask good questions of the data.<\/p>\n<p>&nbsp;<\/p>\n<p>References:<\/p>\n<div class=\"gs_citr\">DAS, SANJIV RANJAN, &#8220;Data science: theories, models, algorithms, and analytics&#8221;, S. R. Das, 2016.<\/div>\n","protected":false},"excerpt":{"rendered":"<p>So you want to be a \u201cdata scientist\u201d? There is no widely accepted de\ufb01nition of who a data scientist is. Several books now attempt to de\ufb01ne what data science is and who a data scientist, It is likely to be an individual with multi-disciplinary training in computer science, business, economics, statistics, and armed with the [&hellip;]<\/p>\n","protected":false},"author":7,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[112],"tags":[],"class_list":["post-2310","post","type-post","status-publish","format-standard","hentry","category-article"],"_links":{"self":[{"href":"https:\/\/international.binus.ac.id\/computer-science\/wp-json\/wp\/v2\/posts\/2310"}],"collection":[{"href":"https:\/\/international.binus.ac.id\/computer-science\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/international.binus.ac.id\/computer-science\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/international.binus.ac.id\/computer-science\/wp-json\/wp\/v2\/users\/7"}],"replies":[{"embeddable":true,"href":"https:\/\/international.binus.ac.id\/computer-science\/wp-json\/wp\/v2\/comments?post=2310"}],"version-history":[{"count":1,"href":"https:\/\/international.binus.ac.id\/computer-science\/wp-json\/wp\/v2\/posts\/2310\/revisions"}],"predecessor-version":[{"id":2311,"href":"https:\/\/international.binus.ac.id\/computer-science\/wp-json\/wp\/v2\/posts\/2310\/revisions\/2311"}],"wp:attachment":[{"href":"https:\/\/international.binus.ac.id\/computer-science\/wp-json\/wp\/v2\/media?parent=2310"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/international.binus.ac.id\/computer-science\/wp-json\/wp\/v2\/categories?post=2310"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/international.binus.ac.id\/computer-science\/wp-json\/wp\/v2\/tags?post=2310"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}