STAT 240 - Fall 2025
Designed Experiment: A study where the researcher controls the conditions under which observations are taken and imposes treatments on individuals in order to observe the responses.
Observational Study: A study where the independent variable is not controlled by the researchers.
Take two plots of wheat
Plot A is a standard species
Plot B is genetically modified
Experimental Unit (EU): The smallest unit to which a treatment is independently assigned/applied.
Observational Unit (OU): The smallest unit on which observations are made.
Treatment (Trt): Experimental condition applied to experimental unit.
Factor: A controlled independent variable; a variable whose levels are set by the experimenter.
Response: The thing we measure to determine the effect of the treatments
Researchers want to understand the difference between two different species of clover
They plant both breeds in equal sizes plots
Each plot is watered/fertilized/cared for the exact same
After 15 days the researchers harvest half of each plot, then the other half after 30 days
Each harvest is dried and weighed in bulk
The “signal” or information in experimental data can be very clear
We can get rid of messy components of processes
We can control for things we don’t want
Two individuals are determined to be similar in the ways that are important to a study, one is given treatment A and the other is given treatment B.
Twin studies
Genetically identical crops / mice
Topographically identical locations?
Each individual is given both treatments with randomized order and assignment of order to avoid bias.
If observing variable A doesn’t change our observation of variable B
We observe that fish in a stream near Manhattan are all purple
If our data was generated from a series of dependent processes
We have to do something to adjust for that
The guesses on bag prices might still have an accurate variance
But we have to consider the shift in guesses
If our data is from an independent process we can work with it as it is
Each individual in the experiment much be worked with identically
Imagine if you performed a study on two groups of rats
Both are run through a maze and timed on speed of completion
If I can’t repeat your experiment using your exact same steps and get similar results
That’s not a good experiment
Your results are likely invalid
You’ve potentially committed academic fraud
Replication in observational studies
All experiments, studies, and models are crude representations of reality
If a study is making strong claims like:
Our experiment perfectly captures the biological processes
These results display the exact expected outcomes in nature
Double check their work
Idaho, Montana, and Wisconsin histotically have an issue with wolves killing cattle and sheep on ranches. The states keep track of every wolf that’s killed by farmers or hunters as well as every cow/sheep that’s killed by wolves.
Over time they found that as more wolves were killed more cattle and sheep were killed.
Does this make sense?
What could be causing this result?
Confounding: Two variables in a study/experiment with relatively indistinguishable effects.
Collinearity: Variables are so heavily confounded that we cannot separate the information in one from another at all, they’re 1:1 paired.
We have to adjust for these problems in our design and analysis
Experiments can prevent both of these problems if we design well
Systematic bias
One outcome is systematically preferred over others
An improperly callibrated scale
A poorly designed survey
Inconsistent treatment of subjects
Unlike data fraud, this isn’t always intentional
Unlike random error, this is always preventable
If you’re conducting a trial to determine the impact of a social welfare program, should you reward your participants?
Poor/homeless individuals are easy to exploit
Paying people for their time and effort is how our economic system functions
You acquire \(3000\) patients for an experimental HIV vaccine trial and provide \(100\) of them a placebo.
Is it wrong to administer a placebo?
You’ve created a clear effect of treatment and eliminated a lot of confounding variables. Isn’t that good?
You want to determine the pest resilience of a strain of GMO corn. Your experiment is industry sponsored and is taking place at an industry owned farm, directly adjacent to a family owned farm. Your crop spills over to the family owned farm and changes the genetic makeup of their crop.
Have you harmed or helped them?
Do you owe them compensation?
Are they stealing your intellectual property by selling their crop now?