2 RStudio Projects
RStudio Projects offer a powerful way to organize data analysis work. Think of a single folder that contains everything we need, from imported data to final visualizations. This self-contained structure promotes clear project organization and facilitates seamless collaboration with others. In this chapter, we explore how to create, manage, and make the most of RStudio Projects, while also learning how to write and save R code in scripts.
2.1 Working with RStudio Projects
One of the advantages of RStudio IDE is that allows us to work with RStudio Projects. The RStudio Projects are recommended for the following main reasons:
When working in R, the program needs to know where to find input files (e.g., datasets) and where to save output files (e.g., R scripts, figures). It first looks into the working directory. When an R session is initiated through a project file (.Rproj), the working directory is automatically set to the project’s root folder.
RStudio Project is a powerful feature that enables users to organize all their files and switch between different projects and tasks without mixing up datasets, code scripts, or output files.
2.1.1 Creating an RStudio Project
Let’s create our first RStudio Project to use for the rest of this textbook. From the RStudio menu bar (FIGURE 2.1) we select:
File New Project...
Alternatively, we can click the plus project icon
in the main RStudio menu, or select New Project... from the top-right Project menu (FIGURE 2.2):
Then, we follow the steps in FIGURE 2.3 - FIGURE 2.5.
In Step 3 (FIGURE 2.5) the directory name that we type will be the project’s name. We call it whatever we want, for example “my_first_project”.
Once we have completed this process, R session switches to the new RStudio Project with the name “my_first_project” (FIGURE 2.6).
2.1.2 Organizing an RStudio Project folder
An RStudio Project folder can be organized and managed just like the folders and subfolders on a computer. For our purposes, it is sufficient to consider a simple RStudio Project folder structure that contains two subfolders: data (for saving data files of any kind, such as .csv, .xlsx, and .txt) and figures (for saving plots, diagrams, and other charts), as shown schematically below:
my_first_project/
├── data
└── figures
Using the
button in RStudio’s Files pane, we can easily create new subfolders within the main RStudio Project directory. FIGURE 2.7 demonstrates how to create the “data” subfolder.
To add the “figures” subfolder, we simply repeat the same steps (FIGURE 2.8):
Therefore, we end up to the following RStudio Project folder structure (FIGURE 2.9):
INFO
The file named
2.2 Opening a new R script
Typically, R code is written in script files denoted by the .R extension. An R script is a plain text file containing R code that can be executed in the R console. The use of R script files offers various benefits, including the ability to execute code in chunks rather than individual lines, which enhances code reuse and organization. Furthermore, R script files enable easy documentation through one-line comments prefixed with the hash symbol (#), and support efficient code sharing with others.
To open an R script in RStudio (FIGURE 2.10), we navigate to the menu bar at the top of the RStudio window and follow these steps:
File New File R Script
Alternatively, we can use the plus icon
from RStudio toolbar or the keyboard shortcut Ctrl+Shift+N for Windows/Linux or Cmd+Shift+N for Mac.
Another pane, the “Source Editor”, is opened on the left above the Console pane (FIGURE 2.11). In the Source Editor, we can write a lengthy script with numerous code chunks and save the file for future use. By default, the first unsaved R script in an R session is named “Untitled 1”.
We can change the size of the panes by either clicking the minimize or maximize buttons on the top right of each pane, or by clicking and dragging the middle of the borders of the panes. Additionally, the four panes can be arranged in a different order from that shown in FIGURE 2.11. If desired, we can change where each pane appears within RStudio by adjusting the preferences (FIGURE 2.12). We select from RStudio menu:
Tools Global Options Pane layout
Now, let’s type 14 + 16 at a new R script in the Source Editor pane and press the
button located at the top of the Source Editor. The result is printed in the Console (FIGURE 2.13):
INFO
In an R script, we can execute code line by line by placing the cursor on a line and pressing the Run button in the Source Editor. Alternatively, we can run a selected chunk of code by highlighting it and then clicking Run. The selected code can also be run using the keywboard shortcut Ctrl+Enter on Windows/Linux or Cmd+Enter on Mac.
2.3 Adding comments and saving a script
Comments can also be used to explain R code, and to make the script more readable. They can also be used to prevent execution when testing alternative code (FIGURE 2.14).
In R, comments begin with the hash symbol (#). When the code is executed, anything following the # is ignored by R. Including comments in code when writing R scripts is considered good practice.
INFO
Keyboard Shortcut for commenting in/out at a time:
Ctrl+Shift+C for Windows/Linux or Cmd+Shift+C for Mac .
Finally, we can save our R script in the RStudio Project folder. The simplest way is to click on the save icon
, give a file name to the script such as “my_script” and then press the “save” button to store it in “my_first_project” folder (FIGURE 2.15).
Now, the folder structure of our RStudio Project should include the following items (FIGURE 2.16):
INFO
The .Rhistory file, created automatically by RStudio, contains a history of the code that has been executed.
If we close the pane containing the R script, we can reopen it by clicking on the “my_script.R” file in the “Files” tab.