The glossary is being gradually proof checked, but currently has many typos and misspellings.
Semi-supervised learning uses partially labeled data, that is where some of the training data is labelled with expected outputs/calssifications and some is unlabelled.