 
 
    
      Tutorial: Analysing Categorical Data in R
    
    
    Marine
    Cadoret, Applied Mathematics Department,
    Agrocampus Ouest, France
    Sébastien
    Lê, Applied Mathematics Department,
    Agrocampus Ouest, France
     
    
      Abstract
    
    
      In many applications, people are interested in the
      relationship between categorical variables. The aim of this
      tutorial is to propose an overview of both descriptive and
      inferential methods for categorical data.
    
    
      Outline
    
    Topics will include: 
    
      - 
        Introduction to categorical data via genomic, sensory and
        ecological data and their specific problems.
      
- 
        Describing, analysing, and visualizing one categorical
        variable: graphical outputs, confidence interval for a
        proportion.
      
- 
        Describing, analysing, and visualizing two categorical
        variables: contingency table, Correspondence Analysis,
        Chi-square test. Application to textual data and open ended
        questions.
      
- 
        Describing, analysing, and visualizing several categorical
        variables: from Correspondence Analysis to Multiple
        Correspondence Analysis. Application to the analysis of
        multiple choice questionnaires: introducing and visualizing
        external information in the analysis.
      
- 
        Getting a typology of individuals described by several
        categorical variables: automatic description of groups of
        individuals obtained from a Hierarchical Ascending
        Classification. Introduction to Logistic Regression and
        Logit Models for multinomial responses.
      
- 
        Introduction to Logistic Regression and Logit Models for
        multinomial responses.
      
      Intended Audience
    
    Teachers in data mining and data analysis, researchers in
    applied fields, statisticians whose topic of interest is
    multivariate analysis of categorical data. 
    
      Prerequisites
    
    No prior knowledge is required. 
    
      Related Links
    
    More information will be available (scripts and datasets) at http://factominer.free.fr.	The information
	for the participants will be available on the day of the tutorial both on the webpage or
	at the tutorial itself (but unfortunately not before the day itself).
	
    
    
      References
    
    [1] R (and S-PLUS) Manual to Accompany Agresti's Categorical
    Data Analysis (2002) 2nd edition, Laura A. Thompson, 2009.