Simple terms and phrases
When performing a search one can use simple terms, that is, single words or phrases, that is, a group of words surrounded by double quotes, for example “Pawiak prison”. The use of quotation marks will result in finding only those documents that contain the whole phrase.
Search terms can be combined by Boolean operators. One can use wildcard characters to replace letters, numbers and the series of letters and numbers, search terms similar in spelling, search terms that are a distance away from each other and determine the relevance of particular search terms.
Boolean operators
Wildcard characters
Fuzzy Search
Fuzzy search is used to find simple terms that are similar in spelling – for example, Holocaust, Holokaust. The documents that contain these terms can be found by adding tilde symbol to the term holocaust: holocaust~.
A similarity degree can be determined by a similarity coefficient ranging from 0 (no similarity) to 1 (identical terms). A similarity coefficient is by default set to 0.5. In order to change it, one must add the tilde symbol and a clearly specified coefficient to the search term – for example, holocaust~0.4.
Proximity search
It is also possible to specify search terms that are a distance away from each other (the so-called proximity search). If one remembers that a document contains two words a short distance away, for example Gestapo and torture, one can use the following search query: ‘Gestapo torture ~6’.
Term relevance specification
It is possible to specify a term relevance by adding a symbol ^ along with a number (greater than 1). For example, a search query Lange^4 Sajnóg will result in finding documents that contain both names, but the name labelled as more relevant will top the search list (Lange). Relevance is by default set to 1.
Query Grouping
Phrases used in complex queries can be grouped by means of parentheses. This allows extensive queries to acquire their unequivocal meaning, as is the case with arithmetic operations.
The query “obóz zagłady w Treblince” AND (Holokaust OR Holocaust) will result in finding documents that contain a phrase “obóz zagłady w Treblince” and one of the two words (or both): Holocaust, Holocaust.
Special characters
For obvious reasons in the search process characters used to create complex queries (+ – && || ! ( ) { } [ ] ^ ” ~ * ? : ) have a different role to fulfill from all the other characters. These special characters are part of the query syntax and not part of the search term. In order to include them in the search, one needs to use the so-called escape character . For example, in order to find a phrase “(2+2)*2” use the query “( 2 + 2 ) * 2“.
Source
Full instruction on making queries: Apache Lucene Query Parser Syntax.
The text was originally published on the website of Kujawsko-Pomorska Digital Library.
Content is available under Creative Commons Attribution – Share Alike 2.5 Poland License.
The library of the Pilecki Institute
ul. Stawki 2, 00-193 Warszawa
Monday to Friday, 9:00 - 15:00
(+48) 22 182 24 75
The library of the Berlin branch of the Pilecki Institute
Pariser Platz 4a, 00-123 Berlin
Pon. - Pt. 10:30 - 17:30
(+49) 30 275 78 955
This page uses 'cookies'. More information
Ever since it was established, the Witold Pilecki Institute of Solidarity and Valor has been collecting and sharing documents that present the multiple historical facets of the last century. Many of them were previously split up, lost, or forgotten. Some were held in archives on other continents. To facilitate research, we have created an innovative digital archive that enables easy access to the source material. We are striving to gather as many archives as possible in one place. As a result, it takes little more than a few clicks to learn about the history of Poland and its citizens in the 20th century.
The Institute’s website contains a description of the collections available in the reading room as well as the necessary information to plan a visit. The documents themselves are only available in the Institute’s reading room, a public space where material is available free of charge to researchers and anyone interested in the topics collected there. The reading room also offers a friendly environment for quiet work.
The materials are obtained from institutions, public archives, both domestic and international social organizations, as well as from private individuals. The collections are constantly being expanded. A full-text search engine that searches both the content of the documents and their metadata allows the user to reach the desired source with ease. Another way to navigate the accumulated resources is to search according to the archival institutions from which they originate and which contain hierarchically arranged fonds and files.
Most of the archival materials are in open access on computers in the reading room. Some of our collections, e.g. from the Bundesarchiv, are subject to the restrictions on availability resulting from agreements between the Institute and the institutions which transfer them. An appropriate declaration must be signed upon arrival at the reading room in order to gain immediate access to these documents.
Before your visit, we recommend familiarizing yourself with the scope and structure of our archival, library and audio-visual resources, as well as with the regulations for visiting and using the collections.
All those wishing to access our collections are invited to the Pilecki Institute at ul. Stawki 2 in Warsaw. The reading room is open from 9–15, Monday to Friday. An appointment must be made in advance by emailing czytelnia@instytutpileckiego.pl or calling (+48) 22 182 24 75.
Please read the privacy policy. Using the website is a declaration of an acceptance of its terms.