Monday, June 8, 2015

Wisdom from the wellspring: attending a Central Texas DITA Users' meetup

I've been meaning to attend a Central Texas DITA Users' meetup now for ages, having lurked their message boards for years. And I finally went!

"DITA users" group doesn't begin to describe it--more like "creators-of-the-DITA-OT, there-from-the-beginning-and-still-fighting-the-good-fight" users' group might start to cover it.

Wednesday, June 3, 2015

Baby steps toward a Bayes classifer

Actually, the blog title is a misnomer--I started trying to code up a naive Bayes classifer while studying chapter 6 of the Python NLTK book, but I didn't get very far before I switched over to a tutorial on the scikit-learn random forest classifer, because it aimed at building something instead of demonstrating little building blocks.

I haven't gotten far, but at this point I can create a feature set from a cleaned-up 'bag of words' using scikit-learn.



Next up: training the random forest! (I love the terminology involved, by the way. "training the random forest" sounds like a nonsense poem).