STATISTICS with R

A comprehensive guide to statistical analysis in R

Data sets for Practice

Hands‑on practice is essential for learning statistics and developing real data‑analysis skills. This page provides curated practice data sets to help you apply statistical methods in R, strengthen your understanding, and build confidence through real‑world analysis.

Data Description Variables Suggested Modeling
Brain tumor. A study on the effectiveness of radiosurgery treatment of primary brain tumor patients. sex: Male / Female
diagnosis: tumor type
location: part of brain
KI: Karnofsky index
GTV: Gross Tumor Volume
Treatment method: Radiosurgery method
status: survival status
time: survival time
Kaplan-Meier Curve
Cox regression
Pima Indians Diabetes is a study on development of diabetes among female Pima ethnic group. Pregnancies: Number of pregnancies
Glucose: Glucose level
BloodPressure: Blood pressure
SkinThickness: Skin thickness
Insulin: Insulin level
BMI: Body Mass Index
DiabetesPedigreeFunction: Diabetes Pedigree Function
Age: Age
Daibetes: Diabetes status
Logistic regression
Multiple regression
Arsenic level. A study that compares arsenic level in water pipes with an acceptable level. Arsenic_level: Arsenic level in water pipes One-sample t-test
Body fat. Estimating body fat by knowing circumference measures of different body parts. Penrose et al (1985) BodyFat: Body fat
Age: Age
Circumference: Different body parts
Multiple regression
Breast cancer. Classification of breast cancers using cell measurements. Mangasarian and Wolberg (1990) type: Breast cancer diagnosis Logistic regression
Cardamom. Does cardamom have any effect on blood pressure? Blood pressure: Before and After Paired Samples t-test
Repeated measures ANOVA
Exercisse and optimism. Research on the relationship between exercise and optimism. Exercise_Freq: Frequency of exercise per week
Optimism_Level: Optimism outlook
Kendall tau b
Breast tumor. A randomized trial evaluating hormonal treatment and the duration of chemotherapy in node-positive breast cancer patients in Germany. Schumacher et al (1994). Recurrence Free Survival Time [days]
Recurrence Event
Hormonal Therapy
Tumor Grade
Kaplan-Meier Curve
Cox regression
Study hours. A study on the relationship between hours of study and exam scores. Study hours
Test scores
Pearson correlation
Simple regression
Weight loss A research study on the relationship between exercise and weight loss. Hours of weekly exercise
Weight loss (lb)
Spearman correlation
Teaching methods. A research study on the effectiveness of two math teaching methods. Group: teaching method
Math score
Independent samples t-test
One-way ANOVA
Recurrent gliomas. A research study on the survival time of recurrent malignant Gliomas patients. Rostomily et al (1994) Glioma_Type
Status: survived or censored
Time_Week: time in weeks to event
Kaplan-Meier Curve
Cox regression
Physical therapy methods. A research study on the effectiveness three physical therapy methods on recovery time. Therapy method
Recovery time (days)
One-way ANOVA
Physical therapy methods and knee injury severity. Investigating the interaction effect of physical therapy methods and injury severity on recovery time. Therapy method
Knee injury severity
Recovery time (days)
Two-way ANOVA
Math anxiety. Does providing a school-based yoga program to school children reduce their math anxiety? Math anxiety score
Repeated Measures ANOVA
Sleep position. Is there a relationship between Sleep position (sleeping on side versus on back) and Backache complaints? Sleep position: on back, on side
Backache status
Chi-squared test
Logistic regression
Study hours, Motivation, Test Scores. What is the relationship between the number of hours students study, students’ academic motivation and their test scores? Study hours
Motivation
Test scores
Multiple regression
Urine analysis. What urine characteristics can predict the presence of calcium oxalate crystals? Andrews et al (1985) r: binary indicator of calcium oxalate crystals; gravity: urine density; ph; osmo: Osmolarity; cond: conductivity; urea: urea (mmol/L); calc: calcium (mmol/L) Logistic regression
Conception after laparoscopy. What is probability of conception in patients after undergoing laparoscopy and hydrotubation? Luthra et al (1982) Months: months to conception
Conception: conceived or not
Kaplan-Meier Curve

Scroll to Top