Lab 5a
I used my corpus of science fiction texts from Lab 3 for this experiment.
This is the list of topics given by MALLET from my corpus. Two of the topics contain words associated with the Gutenberg Project and copyright stuff. I consider those outliers since they have little to do with the actual stories. Interestingly though, words associated with man and masculine titles show up in multiple topics.
In several of the topics, one work will dominate it over the rest, particularly the larger stories like Rudyard Kipling’s With the Night Mail _and Ayn Rand’s _Anthem.
This isn’t necessarily true in all topics though. Some have a much more even split.
I tried fewer topics this time, ten instead of fifteen.
The ten topics look very similar to the fifteen I had before. The two Gutenberg/copyright-related ones are still there and there are many words related to men present still.
And I get similar results with five topics as well.