<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Recent changes to 32: Fine tuning the output of KWE</title><link>https://sourceforge.net/p/lt4el/feedback-tracker/32/</link><description>Recent changes to 32: Fine tuning the output of KWE</description><atom:link href="https://sourceforge.net/p/lt4el/feedback-tracker/32/feed.rss" rel="self"/><language>en</language><lastBuildDate>Wed, 27 Jun 2007 13:14:08 -0000</lastBuildDate><atom:link href="https://sourceforge.net/p/lt4el/feedback-tracker/32/feed.rss" rel="self" type="application/rss+xml"/><item><title>Fine tuning the output of KWE</title><link>https://sourceforge.net/p/lt4el/feedback-tracker/32/</link><description>&lt;div class="markdown_content"&gt;&lt;p&gt;I checked the KWE output for all languages wrt to patterns of unwanted keywords.&lt;br /&gt;
Here are some suggestions. Please, remove&lt;br /&gt;
0. All words which appear more than once&lt;br /&gt;
1. all strings of lower case character and or punctuation signs up to a length of three. Remark: I doubt that a good keyword has only three (lower case) characters or less. Upper case should be excluded here, cf. XP, XML etc. Any other counterexamples???&lt;br /&gt;
2. all strings which contain digits. Stricter version: strings which do include digits but do not end in one (in case we would like to include Win2000, Win98 etc.)&lt;br /&gt;
3. Strings which contain one of the following punctuation characters --&amp;gt; ., &amp;amp;, ,, /,(,),[,],%,*,&amp;lt;,&amp;gt;,%,+,:,", _ Remark: the following punctuation signs can be part of words and words containing them should be kept: §, ' (singlequote), - (dash)&lt;/p&gt;
&lt;p&gt;These three rules should do a decent job&lt;/p&gt;&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Lothar Lemnitzer</dc:creator><pubDate>Wed, 27 Jun 2007 13:14:08 -0000</pubDate><guid>https://sourceforge.netd5a2a4befeb5bb8e5f4f913c07ba3e01cf5a3169</guid></item></channel></rss>