The DPL platforms are too long and you could use a very, very short executive summary? No problem, I have the technology for it.
After the results you can find the kit to build yourself an extractor in the comfort of your home.
The results
- 93sam: jobs, deal, dds, nms, asking
- aigarius: applications, aigarius, choose, trademarks, apps
- ajt: humbug, effective, neat, hoping, success
- hertzog: broadly, wouter, serve, represent, pushed
- sho: deadline, helps, excellence, freaks, tasks
- sjr: qb, published, xxxx, r, yet
- stratus: stable, websites, feature, submitter, involving
- svenl: unfair, protest, ban, publish, banning
- wouter: controversy, seem, background, delegates, therefore
Acquiring the data
for i in 93sam aigarius ajt hertzog sho sjr stratus svenl wouter
do
wget http://www.debian.org/vote/2007/platforms/$i
done
Tokenizing
1 2 3 4 5 6 |
|
Extracting the most representative keywords
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 |
|
Errata
Jacobo suggests
to use lynx -dump -nolist
or w3m -dump
for a more tokenizer-friendly text expansion.