STATISTICS with R
A comprehensive guide to statistical analysis in R
Data sets for Practice
Hands‑on practice is essential for learning statistics and developing real data‑analysis skills. This page provides curated practice data sets to help you apply statistical methods in R, strengthen your understanding, and build confidence through real‑world analysis.
| Data Description | Variables | Suggested Modeling |
|---|---|---|
| Brain tumor. A study on the effectiveness of radiosurgery treatment of primary brain tumor patients. |
sex: Male / Female diagnosis: tumor type location: part of brain KI: Karnofsky index GTV: Gross Tumor Volume Treatment method: Radiosurgery method status: survival status time: survival time |
Kaplan-Meier Curve Cox regression |
| Pima Indians Diabetes is a study on development of diabetes among female Pima ethnic group. |
Pregnancies: Number of pregnancies Glucose: Glucose level BloodPressure: Blood pressure SkinThickness: Skin thickness Insulin: Insulin level BMI: Body Mass Index DiabetesPedigreeFunction: Diabetes Pedigree Function Age: Age Daibetes: Diabetes status |
Logistic regression Multiple regression |
| Arsenic level. A study that compares arsenic level in water pipes with an acceptable level. | Arsenic_level: Arsenic level in water pipes | One-sample t-test |
| Body fat. Estimating body fat by knowing circumference measures of different body parts. Penrose et al (1985) |
BodyFat: Body fat Age: Age Circumference: Different body parts |
Multiple regression |
| Breast cancer. Classification of breast cancers using cell measurements. Mangasarian and Wolberg (1990) | type: Breast cancer diagnosis | Logistic regression |
| Cardamom. Does cardamom have any effect on blood pressure? | Blood pressure: Before and After |
Paired Samples t-test Repeated measures ANOVA |
| Exercisse and optimism. Research on the relationship between exercise and optimism. |
Exercise_Freq: Frequency of exercise per week Optimism_Level: Optimism outlook |
Kendall tau b |
| Breast tumor. A randomized trial evaluating hormonal treatment and the duration of chemotherapy in node-positive breast cancer patients in Germany. Schumacher et al (1994). |
Recurrence Free Survival Time [days] Recurrence Event Hormonal Therapy Tumor Grade |
Kaplan-Meier Curve Cox regression |
| Study hours. A study on the relationship between hours of study and exam scores. |
Study hours Test scores |
Pearson correlation Simple regression |
| Weight loss A research study on the relationship between exercise and weight loss. |
Hours of weekly exercise Weight loss (lb) |
Spearman correlation |
| Teaching methods. A research study on the effectiveness of two math teaching methods. |
Group: teaching method Math score |
Independent samples t-test One-way ANOVA |
| Recurrent gliomas. A research study on the survival time of recurrent malignant Gliomas patients. Rostomily et al (1994) |
Glioma_Type Status: survived or censored Time_Week: time in weeks to event |
Kaplan-Meier Curve Cox regression |
| Physical therapy methods. A research study on the effectiveness three physical therapy methods on recovery time. |
Therapy method Recovery time (days) |
One-way ANOVA |
| Physical therapy methods and knee injury severity. Investigating the interaction effect of physical therapy methods and injury severity on recovery time. |
Therapy method Knee injury severity Recovery time (days) |
Two-way ANOVA |
| Math anxiety. Does providing a school-based yoga program to school children reduce their math anxiety? |
Math anxiety score |
Repeated Measures ANOVA |
| Sleep position. Is there a relationship between Sleep position (sleeping on side versus on back) and Backache complaints? |
Sleep position: on back, on side Backache status |
Chi-squared test Logistic regression |
| Study hours, Motivation, Test Scores. What is the relationship between the number of hours students study, students’ academic motivation and their test scores? |
Study hours Motivation Test scores |
Multiple regression |
| Urine analysis. What urine characteristics can predict the presence of calcium oxalate crystals? Andrews et al (1985) | r: binary indicator of calcium oxalate crystals; gravity: urine density; ph; osmo: Osmolarity; cond: conductivity; urea: urea (mmol/L); calc: calcium (mmol/L) | Logistic regression |
| Conception after laparoscopy. What is probability of conception in patients after undergoing laparoscopy and hydrotubation? Luthra et al (1982) |
Months: months to conception Conception: conceived or not |
Kaplan-Meier Curve |