Session 7
Building text pipelines for signal extraction.
This session matters because many technology signals live in text, not just networks. It follows the data and network labs by introducing an NLP pipeline for technical corpora. Students learn preprocessing, keyword extraction, and the logic of entity recognition. They produce a keyword and entity summary that captures early thematic signals. Skills developed include text cleaning, feature extraction, and validation of automated outputs. This foundation prepares students for topic modeling and clustering in the next session.
Submission for this session
Previous session
Session 6: SNA Macro: Ecosystem Topology
Next session
Session 8: Topic Modeling and Clustering