課程編碼 Course Code | 中文課程名稱 Course Name (Chinese) | 英文課程名稱 Course Name (English) | 總學分數 Credits | 總時數 Hours |
---|---|---|---|---|
4235055 | 數據科學概論 | Introduction to Data Science | 3.0 | 3 |
中文概述 Chinese Description | 數據科學是基於數據並由數據驅動的一門科學範疇,雖然自有科學以來,實證研究莫不仰賴數據作為基礎,但是卻從沒有一個時代像如今數據的產生如此巨量、數據的傳輸如此快速、數據的儲存價格如此低廉 和數據的分析能力如此強大,因此數據科學已經成為一個新的典範,並成為維基百科所引述的:「數據科學是一個整合統計、數據分析、機器學習及其相關方法的概念,以便用數據理解和分析實際現象」(原始出處見https://en.wikipedia.org/wiki/Data_science)。為提供學生關於數據科學的概觀和練習,在這門概論課程中,我們將介紹數據科學的眾多面向,並利用電腦軟體來探討數據的儲存、彙整、梳理、取樣、建模、探勘、視覺化、統計分析和機器學習等課題。 | |||
英文概述 English Description | Data science is a field of science that is based on data and driven by data. Although evidence-based scientific research has always relied on data as its foundation, there has never been an era that data volume is so large, data transmission is so speedy, data storage is so cheap, and data analysis is so advanced. Therefore, data science has become a new paradigm in the current era. According to the statement quoted by Wikipedia, data science is a "concept to unify statistics, data analysis, machine learning and their related methods" in order to "understand and analyze actual phenomena" with data (see https://en.wikipedia.org/wiki/Data_science for the original source). In order to provide students with an overview and hands-on practice of data science, we will present the many aspects of data science in this introductory course, and use computer software to explore various topics such as data storage, consolidation, cleaning, sampling, modeling, exploration, visualization, statistical |
備註: