diff options
author | Cédric Bonhomme <kimble.mandel@gmail.com> | 2012-11-28 11:39:23 +0100 |
---|---|---|
committer | Cédric Bonhomme <kimble.mandel@gmail.com> | 2012-11-28 11:39:23 +0100 |
commit | 84a79ec06541c7db92af48b43d1d4d379cded730 (patch) | |
tree | acbaa6aa38153717d6cf360e519325e56f054492 /source/var/generate-top-words-list.sh | |
parent | Fix: number of feeds wan no longer displayed in the navigation bar. (diff) | |
download | newspipe-84a79ec06541c7db92af48b43d1d4d379cded730.tar.gz newspipe-84a79ec06541c7db92af48b43d1d4d379cded730.tar.bz2 newspipe-84a79ec06541c7db92af48b43d1d4d379cded730.zip |
Ignore stop words when calculating top words.
Diffstat (limited to 'source/var/generate-top-words-list.sh')
-rwxr-xr-x | source/var/generate-top-words-list.sh | 8 |
1 files changed, 8 insertions, 0 deletions
diff --git a/source/var/generate-top-words-list.sh b/source/var/generate-top-words-list.sh new file mode 100755 index 00000000..2a87e147 --- /dev/null +++ b/source/var/generate-top-words-list.sh @@ -0,0 +1,8 @@ +#!/bin/sh + +if test $# != 2 ; then + echo No input files given 1>&2 + exit 1 +fi + +awk 'BEGIN{FS = " "} { if ($1 ~ /^[A-Za-z]/) {print $1}}' $1 | sort | tr '\n' ';' > $2
\ No newline at end of file |