Difference between revisions of "Stopwords for Greek and Latin"

From The Digital Classicist Wiki
Jump to navigation Jump to search
(Added categories (FAQ, Tools))
m
Line 1: Line 1:
 
== Status quaestionis ==
 
== Status quaestionis ==
  
stop word, n. A very common word that is generally uninteresting to search for (a [http://xtf.wiki.sourceforge.net/underHood_Documents#StopWords| XTF Definition]).
+
'''stop word''', n. A very common word that is generally uninteresting to search for (a [http://xtf.wiki.sourceforge.net/underHood_Documents#StopWords| XTF Definition]).
  
 
If you are not a linguist with a special interest in words like Latin "cum" or Greek "kai", or if you have a large collection of Greek or Latin texts and want to make searches in these collection more efficient, or if you have to prepare an index to such a collection (based on concordances of such collection), it is useful to have a list of stop words handy.  Of course, such "uninteresting" words will not be excluded from your search results (thanks to so called "bigramming", q. v. on the [http://xtf.wiki.sourceforge.net/underHood_Documents#StopWords|XTF Definition link]). Also, you can have both, providing to users of your collections searches with filtered stop words and without such filter (as it is done in [http://www.lib.uchicago.edu/efts/PERSEUS/latin.html#| Perseus under PhiloLogic]).
 
If you are not a linguist with a special interest in words like Latin "cum" or Greek "kai", or if you have a large collection of Greek or Latin texts and want to make searches in these collection more efficient, or if you have to prepare an index to such a collection (based on concordances of such collection), it is useful to have a list of stop words handy.  Of course, such "uninteresting" words will not be excluded from your search results (thanks to so called "bigramming", q. v. on the [http://xtf.wiki.sourceforge.net/underHood_Documents#StopWords|XTF Definition link]). Also, you can have both, providing to users of your collections searches with filtered stop words and without such filter (as it is done in [http://www.lib.uchicago.edu/efts/PERSEUS/latin.html#| Perseus under PhiloLogic]).

Revision as of 12:09, 26 August 2008

Status quaestionis

stop word, n. A very common word that is generally uninteresting to search for (a XTF Definition).

If you are not a linguist with a special interest in words like Latin "cum" or Greek "kai", or if you have a large collection of Greek or Latin texts and want to make searches in these collection more efficient, or if you have to prepare an index to such a collection (based on concordances of such collection), it is useful to have a list of stop words handy. Of course, such "uninteresting" words will not be excluded from your search results (thanks to so called "bigramming", q. v. on the Definition link). Also, you can have both, providing to users of your collections searches with filtered stop words and without such filter (as it is done in Perseus under PhiloLogic).

However, at the moment there are no stop word lists freely available for Greek or Latin; it seems that people compile them when they need them (and if they have the time), thereby doing the same all over again, instead of possibly improving on what others already did.

Bibliography

The tag LatinWordStopList on bibsonomy provides a working bibliography of bookmarks and publications on word frequency in Latin.