Boolean Searching on the Internet


A PRIMER IN BOOLEAN LOGIC


The Internet is a vast computer database. As such, its contents must be searched according to the rules of computer database searching. Much database searching is based on the principles of Boolean logic. Boolean logic refers to the logical relationship among search terms, and is named for the British mathematician George Boole.

On Internet search engines, the options to construct logical relationships among search terms extend beyond the traditional practice of Boolean searching. This will be covered in the section below, Boolean Searching on the Internet.

Boolean logic consists of three logical operators:
Each operator can be visually described by using Venn diagrams, as shown below.
OR
Venn diagram for OR

college OR university

Query:    I would like information about college.

OR logic is most commonly used to search for synonymous terms or concepts.

Here is an example of how OR logic works:

Search terms Results
college 17,320,770
university 33,685,205
college OR university 33,702,660

OR logic collates the results to retrieve all the unique records containing one term, the other, or both.

The more terms or concepts we combine in a search with OR logic, the more records we will retrieve.

Venn diagram for OR
For example:

Search terms Results
college 17,320,770
university 33,685,205
college OR university 33,702,660
college OR university OR campus 33,703,082

AND
Venn diagram for AND

poverty AND crime

Query:    I'm interested in the relationship between poverty and crime.

Here is an example of how AND logic works:

Search terms Results
poverty 783,447
crime 2,962,165
poverty AND crime 1,677

The more terms or concepts we combine in a search with AND logic, the fewer records we will retrieve.

Venn diagram for AND
For example:
Search terms Results
poverty 783,447
crime 2,962,165
poverty AND crime 1,677
poverty AND crime AND gender 76

A few Internet search engines make use of the proximity operator NEAR. A proximity operator determines the closeness of terms within a source document. NEAR is a restrictive AND. The closeness of the search terms is determined by the particular search engine. For example, NEAR in AltaVista (Power Search) is 10 words. As another example, Google defaults to proximity searching by default.
NOT
Venn diagram for NOT

cats NOT dogs

Query:    I want to see information about cats, but I want to avoid seeing anything about dogs.

Here is an example of how NOT logic works:

Search terms Results
cats 3,651,252
dogs 4,556,515
cats NOT dogs 81,497

NOT logic excludes records from your search results. Be careful when you use NOT: the term you do want may be present in an important way in documents that also contain the word you wish to avoid.

BOOLEAN SEARCHING ON THE INTERNET


When you use an Internet search engine, the use of Boolean logic may be manifested in three distinct ways:

  1. Full Boolean logic with the use of the logical operators
  2. Implied Boolean logic with keyword searching
  3. Predetermined language in a user fill-in template

1. Full Boolean logic with the use of the logical operators

Many search engines offer the option to do full Boolean searching requiring the use of the Boolean logical operators.

Examples:

Query:    I need information about cats.

Boolean logic:    OR

Search:    cats OR felines

Query:    I'm interested in dyslexia in adults.

Boolean logic:    AND

Search:    dyslexia AND adults

Query:    I'm interested in radiation, but not nuclear radiation.

Boolean logic:    NOT

Search:    radiation NOT nuclear
Query:    I want to learn about cat behavior.

Boolean logic:    OR, AND

Search:    (cats OR felines) AND behavior

Note: Use of parentheses in this search is known as forcing the order of processing. In this case, we surround the OR words with parentheses so that the search engine will first process this part of the search. Next, the search engine with combine this result with the last part of the search. Using this method, we are assured that the OR terms are kept together as a logical unit.

2. Implied Boolean logic with keyword searching

Keyword searching refers to a search type in which you enter terms representing the concepts you wish to retrieve. Boolean operators are not used.

Implied Boolean logic refers to a search in which symbols are used to represent Boolean logical operators. In this type of search on the Internet, the absence of a symbol is also significant, as the space between keywords defaults to either OR logic or AND logic. Many well-known search engines traditionally defaulted to OR logic, but as a rule are moving away from the practice and defaulting to AND.

Implied Boolean logic has become so common in Web searching that it may be considered a de facto standard.

Examples:

Query:    I need information about cats.

Boolean logic:    OR

Search:    cats    felines

This example holds true for the search engines that interpret the space between keywords as the Boolean OR. To find out which logic the engine is using as the default, consult the help files at the site.
Query:    I'm interested in dyslexia in adults.

Boolean logic:    AND

Search:    +dyslexia    +adults

Query:    I'm interested in radiation, but not nuclear radiation.

Boolean logic:    NOT

Search:    radiation    -nuclear
Query:    I want to learn about cat behavior.

Boolean logic:    OR, AND

Search:    cats    felines    +behavior

3. Predetermined language in a user fill-in template

Some search engines offer a search template which allows the user to choose the Boolean operator from a menu. Often the logical operator is expressed with substitute language rather than with the operator itself.

Examples:

Query:    I need information about cats

Boolean logic:    OR

Search:    Any of these words/Can contain the words/Should contain the words

Query:    I'm interested in dyslexia in adults.
Boolean logic:    AND

Search:    All of these words/Must contain the words

Query:    I'm interested in radiation, but not nuclear radiation.

Boolean logic:    NOT

Search:    Must not contain the words/Should not contain the words

Query:    I want to learn about cat behavior.

Boolean logic:    OR, AND

Search:    Combine options as above if the template allows multiple search statements

Quick Comparison Chart:
Full Boolean vs. Implied Boolean vs. Templates

  Full Boolean Implied Boolean Template Terminology
OR college or university college    university
*see note below
any of these words
can contain the words
should contain the words
AND poverty and crime +poverty    +crime all of these words
must contain the words
NOT cats not dogs cats    -dogs must not contain the words
should not contain the words
NEAR, etc. cats near dogs N/A near

* This search statement will resolve to AND logic at search engines that use AND as the default. Examples include AOL.COM Search, Google and Lycos. Be sure to consult the Help files at each engine site find out which logic is the default.

Where to Search:
A Selected List
Feature Search Engine
Boolean operators AltaVista Advanced Search | Dogpile | HotBot | Ixquick Metasearch | ProFusion
Full Boolean logic with parentheses, e.g.,
behavior and (cats or felines)
AltaVista Advanced Search | HotBot | Ixquick Metasearch | MSN Search Advanced Search
Implied Boolean +/- Most search engines offer this option
Boolean logic
by template terminology
AllTheWeb Advanced Search | AOL.COM Search Options | Google Advanced Search | HotBot | Lycos Pro | MSN Search Advanced Search | ProFusion Advanced | Snoopa Advanced Search
Proximity operators AltaVista Advanced Search | Google [by default] | Ixquick Metasearch