Data Science: Productivity Tools Course will help keep your projects organized and produce reproducible reports using GitHub, git, Unix/Linux, and RStudio. There could be many components in a typical data analysis project, each with a different set of data files and scripts. Maintaining this level of organization is difficult.
Learn how to use Unix/Linux to manage your computer’s files and directories in this course part of our Professional Certificate Program in Data Science. A helpful tool for keeping track of changes in your scripts and reports is git, which will be presented to you. In addition, you’ll learn about GitHub and see how it can be used to save your work in a way that encourages teamwork.
Finally, you will learn how to write reports in R markdown, which allows you to incorporate both text and code into a document. RStudio, a solid integrated desktop environment, will be used to bring it all together.
What you’ll learn
- How to use Unix/Linux to manage your file system
- How to perform version control with git
- How to start a repository on GitHub
- How to leverage the many useful features provided by RStudio
a statement outlining one’s personal code of conduct
For HarvardX students who enroll in edX courses, the terms of the honor code apply. Revocation of any HarvardX course certificates or other remedies is a possible response to violations of the edX honor code by HarvardX. This may include removal from the HarvardX course. In the event of remedial action, no refunds will be given. A student enrolled in another program will also be subject to the academic policies of the other institution when taking HarvardX courses.
a statement of the research question
Registering as a student in one of HarvardX’s open online courses entails participating in research aimed at improving the educational offerings of HarvardX and the quality of education and associated sciences around the world in general. You may see some course materials that aren’t the same for research purposes. Learner data collected through HarvardX is not used for any reason other than Harvard’s declared teaching and research goals. Researchers outside of Harvard may access information gathered during online learning activities, including Personally Identifiable Information. Only the minimum amount of information required to carry out the research will be shared with others, and even then, it will be protected by an agreement that stipulates how the information will be shared. Aggregated data may potentially be shared with the public or third parties without your permission. Your personal identity will not be revealed in any published research results.
Read the edX Privacy Policy for additional information about how your data is processed, transmitted, and used.
Statement against harassment and discrimination
There will be no exclusions, denials of benefits, or harassment in our program because Harvard University and HarvardX are committed to a healthy and safe learning and working environment. We will not tolerate any form of discrimination or harassment in our program. edX’s policies on nondiscrimination and sexual harassment, as well as Harvard’s own, apply to all members of the HarvardX community as well. Please email [email protected] or use the edX contact form if you have any questions or issues.