<?xml version="1.0" encoding="utf-8"?>
<feed xml:lang="en" xmlns="http://www.w3.org/2005/Atom"><title>Recent changes to 32: Fine tuning the output of KWE</title><link href="https://sourceforge.net/p/lt4el/feedback-tracker/32/" rel="alternate"/><link href="https://sourceforge.net/p/lt4el/feedback-tracker/32/feed.atom" rel="self"/><id>https://sourceforge.net/p/lt4el/feedback-tracker/32/</id><updated>2007-06-27T13:14:08Z</updated><subtitle>Recent changes to 32: Fine tuning the output of KWE</subtitle><entry><title>Fine tuning the output of KWE</title><link href="https://sourceforge.net/p/lt4el/feedback-tracker/32/" rel="alternate"/><published>2007-06-27T13:14:08Z</published><updated>2007-06-27T13:14:08Z</updated><author><name>Lothar Lemnitzer</name><uri>https://sourceforge.net/u/userid-1604795/</uri></author><id>https://sourceforge.netd5a2a4befeb5bb8e5f4f913c07ba3e01cf5a3169</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;p&gt;I checked the KWE output for all languages wrt to patterns of unwanted keywords.&lt;br /&gt;
Here are some suggestions. Please, remove&lt;br /&gt;
0. All words which appear more than once&lt;br /&gt;
1. all strings of lower case character and or punctuation signs up to a length of three. Remark: I doubt that a good keyword has only three (lower case) characters or less. Upper case should be excluded here, cf. XP, XML etc. Any other counterexamples???&lt;br /&gt;
2. all strings which contain digits. Stricter version: strings which do include digits but do not end in one (in case we would like to include Win2000, Win98 etc.)&lt;br /&gt;
3. Strings which contain one of the following punctuation characters --&amp;gt; ., &amp;amp;, ,, /,(,),[,],%,*,&amp;lt;,&amp;gt;,%,+,:,", _ Remark: the following punctuation signs can be part of words and words containing them should be kept: §, ' (singlequote), - (dash)&lt;/p&gt;
&lt;p&gt;These three rules should do a decent job&lt;/p&gt;&lt;/div&gt;</summary></entry></feed>