known words
currently learning
terms to unify
regex to split segments
Highlight top 33% of unknown words
Text Input