Defining variables, or giving a name to values, such as the numbers you added together, allows you to store information.
You define variables using the <-
operator, which means “store the value on the right in the variable on the left”, like so. You can then call it back by saying the variable name.
variable1 <- 3
variable2 <- 4
variable1
[1] 3
You can use variables in the place of the numbers they hold, and perform mathematical operations on them.
variable1 + variable2
[1] 7
As their name implies, variables can take on many values, and can change, so be careful with naming and assignment.
variable1 <- variable1 + variable2
variable1
[1] 7
variable1 <- variable1 + variable2
variable1
[1] 11
These types of variables are numeric. However, R can also store text. These are in the form of characters. Surround characters in quotation marks to have them be character and not variable names.
char <- "a"
char
[1] "a"
# In your console, try typing 'char <- a' (no quotes) and seeing what happens
(char <- a
results in an error, because a
without quotes is treated as a variable name, and that variable has not been assigned yet.)
Characters variables can store strings, such as words and sentences (in fact, in R, string and character are basically interchangeable). In R, one string (everything between the quotation marks) is stored as a single unit.
char_sentence <- "My favorite food is ice cream"
char_sentence
[1] "My favorite food is ice cream"
char_variable1 <- "variable1"
char_variable1
[1] "variable1"
char_variable2 <- variable1
char_variable2
[1] 11
You can identify the type of variable (numeric, character) using is()
.
# Using is()
is(5)
[1] "numeric" "vector"
# Try identifying the type of variable for char_variable1
is(char_variable1)
[1] "character" "vector" "data.frameRowLabels" "SuperClassMethod"
# How about char_variable2?
is(char_variable2)
[1] "numeric" "vector"
# What is going on?
Notice that the thing we assigned into char_variable1 was “variable_1” in quotes, which is just a character string, like any other, hence it’s listed as a “character” (don’t worry about the rest of this output for now). On the other hand, the thing we assigned into char_variable2 was variable_1 without quotes, which copies the contents of the variable_1 variable and stores that copy in char_variable2. We earlier assigned variable_1 to hold a number, hence char_variable2 is now also a numeric type.
You can also make numbers into strings: just put a quote around them!
num_string <- '4'
actual_num <- 4
# Try identifying the types of num_string and actual_num
is(num_string)
[1] "character" "vector" "data.frameRowLabels" "SuperClassMethod"
is(actual_num)
[1] "numeric" "vector"
You need to be very careful about the types of your variables. It may look like actual_num and num_string are holding the same thing, but because we’ve implicitly told R that they are of different types, they behave very differently. In the Console, try adding num_string to actual_num. What happens, and why? What is R’s cryptic message trying to tell you?
actual_num + num_string
results in an error, described as ‘non-numeric argument to binary operator’; this means you’re trying to perform an operation that is done on numbers (addition) with something that’s not a number (num_string
)
We saw above that you can do basic arithmetic in R by just using mathematical symbols, e.g. +
. But for more complicated things, programming languages use things called functions. A function is something that wraps a whole set of operations so that you can invoke those operations in a single command. You have already used another function earlier in this class, is()
. Let’s take a look at another example: there’s a function, sum()
, that can add numbers, just like +
can. Let’s try it:
variable1
[1] 11
variable2
[1] 4
sum(variable1, variable2)
[1] 15
sum()
can also take more than two inputs!
# Try using the sum function to compute the sum of 3, 4, and 6
sum(3,4,6)
[1] 13
Functions are their own type! You can try using is()
to check this:
is(sum)
[1] "function" "OptionalFunction" "PossibleMethod"
is(is)
[1] "function" "OptionalFunction" "PossibleMethod"
Functions are always followed by ()
when you use them. When you use a function, often you are telling it to perform an operation on something else (e.g. a variable); in the examples above, we’re performing the sum()
function on variable1 and variable2, and we’re performing the is()
function on sum
and on itself. These are called arguments, and are passed to the function inside the parentheses.
Some functions don’t need to take any arguments. For example, you can find out what variables you have loaded in your environment at anytime using ls()
. This will list all your variables, any custom functions you may have written, etc.
ls()
[1] "actual_num" "char" "char_sentence" "char_variable1" "char_variable2" "num_string"
[7] "variable1" "variable2"
Check out the Environment window on the top right! You can actually also see all your variables loaded in there, with a short summary of what’s in them. This is a super convenient feature of programming in Rstudio.
Another really useful function is the paste
function; this function allows you to combine strings.
favorite_subject <- 'biology'
statement_about_subject <- 'is #'
subject_rank <- '1'
biology_quality <- paste(favorite_subject, statement_about_subject, subject_rank)
Notice that the code above didn’t print anything out. That’s because we directed the output of the paste
function into a variable, biology_quality. To get R to print out the current program variable, you could just type it into your console, or you could use a function! Try googling to find an R function that prints out a variable.
# Find an R function to print out a variable, and use it to print whatever
# the biology_quality variable is storing
print(biology_quality)
[1] "biology is # 1"
You can learn about a function by reading its documentation. You can access the documentation with ?
, for example, type ?paste
into your console. You will see the help text pop up in the bottom right Help box in Rstudio.
?
is one of the more important keys for you to know. Seriously, many issues are averted by first checking the documentation. Documentation usually has:
In practice, this information can sometimes be a bit dense and technical. I usually find it most useful to read the ‘description’ on top, and then scroll all the way down to the ‘examples’ section at the bottom of the help function.
Notice that when we used paste above, the function automatically put a space between our two strings. It’s often really useful to combine strings with a different character, for example, a dash. Try to read the documentation for paste()
to figure out how to do this.
# Combine only the favorite_subject, statement_about_subject, and subject_rank
# variables, using a dash between them instead of a space
biology_quality_dash <-
paste(favorite_subject, statement_about_subject, subject_rank, sep = '-')
print(biology_quality_dash)
[1] "biology-is #-1"
sum()
, paste()
, is()