Data Analysis
Candidate | Agreement score |
---|
Classifier Explain

The classifier (multi-label) is a DNN-optimized machine learning
model that we created using the Google AutoML Natural Language
Processing and Google Cloud APIs. We fed the model large sets of
data- sentences and phrases that each candidate has spoken in
public. The data was sourced from interviews, town halls, and
twitter as these sources generally contain unscripted views of the
politicians, important for mitigating input bias.
Look at the graphs above. The confidence index refers to the level
of confidence the model must have to assign a category to a test
item. Precision and recall help us understand how well our model
is capturing information, and how much it's leaving out. Precision
tells us, from all the test examples that were assigned a label,
how many actually were supposed to be categorized with that label.
Recall tells us, from all the test examples that should have had
the label assigned, how many were actually assigned the label.
Team
-
Rohan Sood
Machine Learning/Natural Language Processing
Contact Rohan -
Sam Wight
Wrote parts of the backend and helped out with the front-end.
Contact Sam -
Peter Nguyen
Front end structure and web effect design.
Contact Peter