Experiment Design in Computer Science


Claus Aranha, Tetsuya Sakurai

Experiment Design, Statistical Testing, Statistics, Philosophy of Science, Ethics, R Language 
No particular requirements for registration. This course expect students to be familiar with basic statistical concepts (Random Variables, Distributions, etc). The experiments in the course will use the R language, so familiarity with this language is helpful. This course expect students to be familiar with basic concepts of statistics (Mean, Median, Standard deviation, Population, Random Variables, etc). We also encourage the students to study the R programming language to prepare for this course. 
At the end of the course, students should be able to:

The collection and analysis of data through experiments is one of the cornerstones of the scientific method. In this course, we study the general philosophy and methods behind experimentalism: Why do we perform experiments, what is a good/rigorous experiment, how to plan and design a rigorous experiment, and how to perform statistical analysis on experimental data. This course is centered around lectures with plenty of examples and study cases. The students will be invited to apply the techniques studied in this lecture to experiment of their own design. 
The grade is based on three individual reports. Each report is a case study, where the student will have to: Design an experiment to answer a scientific question, obtain experimental data (from the instructor, or by performing an experiment), analyze the experimental data using the tools studied in the lecture, and prepare a report discussing the experimental conclusions. The report is evaluated on the quality of the experiment design, the correctness of the statistical analysis, and the quality of the discussion of the results (note that positive or negative results do not factor in the grade of the report).

Lecture in English I expect the students to contribute with questions and discussions during the class. Additionally, I will recommend that the students use data from their own research to prepare the reports. Students who cannot provide their own data will be provided with sample data, however you will learn much more if you can use your own data. 