Linguistics Analysis: correct categorization of speech disfluency
Currently, speech parts such as "um" and "like" appear to be categorized as nouns or adverbs (depending on context). It would be great to detect these speech disfluencies. I have an idea for an app to help with feedback for public speaking.
Good feedback, thanks.
There is a part-of-speech tag devoted to such words UH"), and it is applied correctly to words such as "oh". It should be possible to apply this same tag to relatively unambiguous words such as "um". However, "like" will be more difficult because it has several distinct usages (e.g. verb in "I like the truffle oil", or preposition in "his dog is like a wolf").
I'll add it to the to-do list.