Data Wrangling

PDF vs HTML : Extracting information from a Google Form with R

A couple months ago, I had to analyse responses from a questionnaire. I needed to tidy the dataset downloaded from Google Forms as it did not respect the 10 commandments for a well-formated database. I wanted to import questions titles and answer choices into R but these metadata were not provided by the client. When I decided to get them, two choices were possible : either from the downloaded PDF file or from the HTML page of the form.

Continuer la lecture