论文标题
“玩整个游戏”:通过Google日历进行数据收集和分析练习
"Playing the whole game": A data collection and analysis exercise with Google Calendar
论文作者
论文摘要
我们提供了一项适用于本科统计或数据科学课程的早期介绍的计算练习,该课程使学生可以“玩整个数据科学”:同时执行数据收集和数据分析。尽管存在许多用于数据分析的教学资源,但考虑到任务的固有难度,此类资源对于数据收集并不那么丰富。我们提出的锻炼中心围绕学生使用Google日历收集数据,目的是回答“我如何度过时间?”的问题。一方面,该练习涉及回答一个几乎普遍吸引力的问题,但另一方面,数据收集机制并不能超出典型的本科生的范围。该练习的另一个好处是,它为数据提供商和数据分析师当今基于Internet的大规模数据收集时代所面临的道德问题和注意事项提供了讨论的机会。
We provide a computational exercise suitable for early introduction in an undergraduate statistics or data science course that allows students to 'play the whole game' of data science: performing both data collection and data analysis. While many teaching resources exist for data analysis, such resources are not as abundant for data collection given the inherent difficulty of the task. Our proposed exercise centers around student use of Google Calendar to collect data with the goal of answering the question 'How do I spend my time?' On the one hand, the exercise involves answering a question with near universal appeal, but on the other hand, the data collection mechanism is not beyond the reach of a typical undergraduate student. A further benefit of the exercise is that it provides an opportunity for discussions on ethical questions and considerations that data providers and data analysts face in today's age of large-scale internet-based data collection.