Skip to Main Content
University of York Library
Library Subject Guides

Searching for information: a practical guide

Advanced search techniques

Searching for information is not always straightforward. We got a bunch of librarians to suggest some insider pointers and useful techniques.
Feedback
X

We've seen how Google and Google Scholar can use certain special characters or terms in their searches — what are called search operators. Some of these operators, along with a range of other operators, can also be found in most bibliographic databases and many other academic resources including YorSearch. For most of this page we'll be taking a look at some of the most common and most useful operators you could use, along with some other useful advanced searching features.

But first, let's have a reminder of how to perform an advanced search in Google Scholar...

Google Scholar's "Advanced search"

Under the hamburger menu () in Google Scholar you'll find Advanced search which gives you a lot more control over your search.


Let's take a look at each of the options:

With all of the words

This is effectively the default search mode in Google Scholar: an article must contain all of the words in this box. A search for dogs cats mice would find anything that mentioned all three animals somewhere in the text, but if an article only mentioned dogs and cats (no mice) it shouldn't be included in the results.


dogs cats mice

This style of searching is what's called implicit AND because it's searching for articles that contain dogs and cats and mice.

With the exact phrase

In the previous search, our terms could appear anywhere in the document. With this one they have to appear as written. A search in this box for ah good the sea would return only one of the following snippets (all of which would be returned by the previous search):

ah good ..ah good / God knows ah good / please recognize me.! / log of the sea tells it.. / log of the land tells it..

- Clarke, L. (c.1974). Where Hurricane.

Are you in salt? We have been looking all over the place for human beings. Ah good the sea. Professor of non-existence, the body is evidence of the spirit. The natural key.

- Burroughs, W. (1976). It Belongs to the Cucumbers: On the Subject of Raudive’s Tape Voices.

In my opinion, a good bit of Emily Dickinson's poetry is the stanza "Rowing in Eden – / Ahthe Sea! / Might I but moor – / Tonight – / In thee!" from "Wild nights - Wild nights!" (1861).

- University of York (2022). Searching for information: a practical guide.

"ah good the sea"

The above is what you'll see in the search box once you've run the search. You can perform this kind of search without using the "Advanced search" form by just putting your exact phrase in "double quotes like this". This technique is called phrase searching and the double quotes are what we'd call an operator: something which when used in a search box performs a specific special task.

With at least one of the words

This box will return items that mention any of the entered words. A search in this box for whale dolphin orca narwhal will find items where at least one of those words appears:


whale OR dolphin OR orca OR narwhal

Once you've run the search, you'll see the above in your search box — your terms joined with the OR operator. The same operator works in a normal Google search, and in many other search engines and databases.

Use this technique to search for alternate ways of expressing a concept. For instance, a useful search might be orca OR "killer whale" OR orcas OR "killer whales".

Without the words

Documents that mention the words you put in this box will be excluded from your results. If you had leonardo in any of the first boxes and dicaprio in this box you will get matches for any document mentioning leonardo so long as it doesn't also mention dicaprio:


Leonardo DiCaprio image CC BY-SA Christopher William Adach.

leonardo -dicaprio

Once you've run the search, you'll get the above in the search box. Here the minus sign - in front of "dicaprio" is serving as what we might call a NOT operator: leonardo NOT dicaprio. Most databases will actually use the word NOT for this, rather than the minus sign, but the minus sign remains common in internet search engines like Google.

If you were searching for information about York but kept getting results about New York, you could do a search for york -"new york" and this will filter out any documents mentioning "New York". But be careful... maybe a really good article about York also mentions New York somewhere in it.

The other options

The rest of the options are a little more nuanced. In most cases they're ways in which you can limit your search...

Where my words occur

Google Scholar is often able to search the full text of an article, but that can mean you get an awful lot of results. You might therefore want to limit the target of your searching to just the title of an article. Article titles are usually very descriptive so this can be a good way of making sure the article has your search terms as its focus. But obviously there are risks to this approach: a smaller target is harder to hit, and not all titles will be as descriptive as you'd like.

Return articles authored by

Maybe you're interested in a specific author. You can use this box to search for authors by name, and by combining this field with other boxes on the 'Advanced search' screen you can even search within a particular author's body of work.

Return articles published in

It may be that you're only interested in articles published in a specific journal, or in a journal with particular words in the title. If that's the case then this is the box for you!

Return articles dated between

Sometimes you'll only be interested in articles from a specific period in time — perhaps you just want recent articles, or perhaps you're after articles from a particular point in the past. In these boxes you can set a date range for your search.

Going beyond Google Scholar

Most of the above examples can also be used in other databases too...

"Phrase searching"

Generally when we search, the search engine or database we're using will look for documents that match all of the terms we entered. But these terms could appear anywhere in the record. For instance, a search for university of york would find documents mentioning:

  • "...the University of York..."
  • "...the Faculty of Liberal Arts at York University, Ontario..."
  • "...the State University of New York..."
  • "...the School of the Arts at York St John University..."

All four results match our search terms, but only one of the results is an exact match for the terms in the arrangement they were written.

If we wanted to search only for cases where the words university, of, and york appear next to each other in that order we can make use of something called phrase searching... in other words, we can stick double quotes around our terms to make them into a single entity: "university of york" — with this search, the only one of those four results we're getting back is the first one.

Another example

Here's another example: a search for ah good the sea would return all three of the following snippets but a search for "ah good the sea" will only return one of them:

ah good ..ah good / God knows ah good / please recognize me.! / log of the sea tells it.. / log of the land tells it..

- Clarke, L. (c.1974). Where Hurricane.

Are you in salt? We have been looking all over the place for human beings. Ah good the sea. Professor of non-existence, the body is evidence of the spirit. The natural key.

- Burroughs, W. (1976). It Belongs to the Cucumbers: On the Subject of Raudive’s Tape Voices.

In my opinion, a good bit of Emily Dickinson's poetry is the stanza "Rowing in Eden – / Ahthe Sea! / Might I but moor – / Tonight – / In thee!" from "Wild nights - Wild nights!" (1861).

- University of York (2022). Searching for information: a practical guide.

Truncation and wildcards

iconMost databases allow you to enter elaborate controlled searches using special words and characters known as operators. Particularly powerful among these operators are wildcard characters. These work like jokers in a deck of playing cards, able to stand in for other content...

🃏🃏🃏

Truncation

Imagine you were searching for items relating to chewy sweets. Articles mentioning any of the following might be useful to your needs:

  • chew
  • chews
  • chewing [...gum]
  • chewy
  • Chewits

In most of the databases we subscribe to, you can use an asterisk (*) to stand for any potential word endings (as well as for the word itself). This method is called truncation.

Let's consider a search for chew*:

chew

chew

chews

chewing

chewy

chewits

chewed

chewable

chewbacca


There might be some examples there that we're not happy about, but on the whole it might be advantageous to take this approach rather than to try to incorporate every possible inflection of the word into our search.

Think carefully about where the asterisk goes. You need it in just the right place to capture the words you want and not too much that's irrelevant.

Placing the asteriskExercise

Let's take a look at what effect we'd get by placing the asterisk at different points in a word...

Use the arrow controls to move the asterisk to different points in the word below:

c
o
n
f
e
c
t
i
o
n
e
r
y
⬅️
➡️

When truncating, think about the different word endings you might expect to get. Not all words will truncate effectively; for instance, say you were searching for policy / policies...

polic* would find both policy and policies but it would also find police and policing. You might need to formulate your search in a different way.

Wildcards within words

If you wanted to search for yogurt there are some variant spellings to consider:

  • yogurt
  • yoghurt
  • yoghourt

In some databases you could use a wildcard character to stand in for none, one, or sometimes even more characters in a word.

The exact rules (and the wildcard characters used) vary from database to database, but here's an example of the sort of thing you might be able to achieve:

yogurt

...might retrieve...

yogurt

yoghurt

yoghourt


Some wildcards can also stand for a single interchangeable letter, for instance:

carameli?e

...will often retrieve...

caramelise

caramelize


Here's a table of some of the main database providers and their wildcards of choice:

Single letter replacementNo letter or one letterNo letter or any amount of lettersEnd-of-word truncation
EBSCOwom?ncolo#rdo*nuttruncat*
Ovidwom#ncolo?rtruncat*
ProQuest, Scopus & JSTORwom?ncolo*rdo*nuttruncat*
Web of Sciencewom?ncolo?rdo*nuttruncat*

Advanced search forms

Most bibliographic databases have some sort of advanced search option. Even YorSearch has an advanced search, and we've looked in detail at Google Scholar's advanced search interface.

Generally, an advanced search interface will look something like this:

Rather than just the one search box to play with, there'll be two or three, and maybe even the option to add more. Each box might be able to search different parts of the catalogue record, and you will be able to combine the different rows of your search in different ways to achieve different effects. The result is a lot more powerful than a single search box searching everything.

Scope and limits

You can usually set the scope of your search with a dropdown next to the search box. By default you will probably be searching all of the fields within a catalogue record, but you can limit to an individual field such as the title, the author or any keywords or subject headings. If you were looking for books by Charlotte Brontë rather than about Charlotte Brontë, you could limit your search to just search the Author field.

There may be additional limits you can apply to your search, such as date ranges. These might be on the advanced search form itself, or available to apply once you've got your results. We'll look more closely at these fliter controls in our sifting section.

Combining your search

Each row of your search can be combined using a logical operator: usually AND, OR, or NOT...

AND

Connect two rows of your search with AND to require a match on both rows.

OR

Connect two rows of your search with OR to have it match either of the rows.

NOT

Connect two rows of your search with NOT to match the first row and exclude any matches for the second row.

You can use these "Boolean operators" within the search box too. And you can even put bits of your search in brackets (like you would with mathematical formulae) to clear up any ambiguities when using multiple terms. For instance you might search for scones AND (jam OR cream) to find items that mention scones and jam or that mention scones and cream. Indeed, when you run your search using an advanced search form like the one above, you might find it gets translated into a search string with various operators and brackets peppered through it.

An advanced search form may lead you to creating very complicated searches like...

(("cream tea*" AND "clotted cream") OR ("cream tea*" NOT "whipped cream") OR (scone* AND jam AND "clotted cream") OR (scone* AND jam NOT "whipped cream")) AND (tea OR "earl grey" OR darjeeling OR assam OR "lapsang souchong")

The problem with a big search like this is that one misspelling can bring the whole thing down. It's therefore better to build up a search like this gradually.

Tip

Keep your searches as simple as possible to avoid getting tangled up. But don't be afraid to combine terms if you find it helpful. With an advanced search form it's best to treat each row as being separated by an AND, and maybe have more than one alternate term in each row by using OR.

Building a search using a search history

Some databases will keep a history of each search you conduct. Usually this is just retained for the session, but some databases will even let you create an account to save your searches to return to at a later date.

You can often use this search history to combine previous searches into a more complicated search.

In Scopus and Web of Science you can find your search history at the bottom of the "Advanced search" page and use the Combine controls there to merge searches. Here's a typical example, where rows 1 to 4 have been steadily combined over rows 5 to 7:

SearchTermsResultsTools
7
#5 AND #6
32,652   
6
#3 OR #4
259,970   
5
#1 OR #2
295,233   
4
kitten*
5,051   
3
cat OR cats
257,047   
2
puppie
1   
1
dog OR dogs
295,233   

This approach is handy for spotting any mistakes along the way, like line 2 where we misspelled "puppy" or "puppies". Sometimes a misspelling will still pick up results, which is quite fun! We're all fallible.

Building up a search like this also gives you a good sense of what every part of your search is doing, and the type of numbers it's getting.

Proximity operators

Proximity operators allow you to look for a word within a certain distance of another word.

Imagine you were searching for recipes for chocolate cake. You could perhaps use a search like...

"chocolate cake*" AND recipe

...but are there any problems with that approach?

It's a good search in terms of finding chocolate cake and chocolate cakes along with mentions of the word recipe, but it's possible that you'd get recipes that mention chocolate cake as an aside, while missing recipes that insert a word between chocolate and cake:

Chocolate icing recipe

This is a delicious icing that sets off any dessert. Ideal for eclairs, cupcakes, and chocolate cakes.

Ingredients:
300g icing sugar
100g unsalted butter

Chocolate sponge cake

A yummy cake recipe, perfect for all occasions!

Ingredients:
150g unsalted butter
150g caster sugar
100g self-raising flour
3 large eggs


We need a way of locating those results where chocolate cake is close to recipe, or chocolate is close to cake. That's where proximity operators come in...

Proximity operators usually sit between two search terms, and include a number which represents the maximum number of words permitted between the two terms. Even Google lets you use an asterisk (*) to stand for unknown words between terms.

Using proximity operators

Unfortunately, there isn't a universal standard for proximity operators: different databases use different language and syntax...

n words between (any order)n words between (first term first)
EBSCONnWn
JSTOR"term1 term2"~n
Ovid*ADJn
ProQuestNEAR/n
N/n
PRE/n
P/n
ScopusW/nPRE/n
web of ScienceNEAR/n

* In Ovid databases there are n-1 words between terms.


Think carefully about your choice of number. You're trying to find everything of relevance so think through some plausible sentences in your head to get an idea of a suitable value for n...

  • 0
    the York chocolate industry
  • 2
    chocolate production in York
  • 4
    York continued to produce quality chocolate
  • 6
    York achieved economic stability via its booming chocolate industries

The bigger the number, the further apart the two words can be.

Some databases also have options for searching within a single sentence or paragraph. Always check the help section of the database you're using to find out more information about how the operators work.

An example

In this example we'll use the operator NEAR/n

The search chocolate NEAR/4 cake* will find cases where there are 4 words or less between the word chocolate and the word cake*...

Numbers in the table indicate the number of words from the first matched word
0123456
✔ n=0chocolatecake
✔ n=1chocolateorangecake
✔ n=2chocolatemintmarblecakes
✔ n=2cake,wineandchocolate
✔ n=3cakedthickwithdarkchocolate
✔ n=4cakesthatarerichinchocolate
✘ n=5cakewassuperbandtherichchocolate

chocolate NEAR/4 cake*

Subject headings

Some databases will let you search directly against subject headings or keywords. These might be submitted by the author, or in some cases will be standardised subject headings imposed by the database cataloguers. Standardised headings use a controlled vocabulary that will be applied uniformly across articles on the same topic, irrespective of the terminology used in those articles. For example: articles about cats, kittens, and moggies might all be indexed under the uniform subject heading "felines".

Databases may also have a thesaurus feature which will map a search term (e.g. "cat") to the subject heading used (e.g. "felines").

catIn certain cases, subject headings may even be arranged in a hierarchy, for example "mammals" > "felines" > "cats, domestic", and may enable you to either search on a single level alone, or to explode at a point in that hierarchy so as to include all branches beneath.

Advanced literature searching

There's a lot of stuff to find. How can you find the things you really need? In this bitesized session, we show you some advanced literature searching tricks.

Forthcoming training sessions

Forthcoming sessions on :

Taught students
Staff
Researchers
Show details & booking for these sessions

There's more training events at:

Citation searching

Finding a single journal article can lead you to more texts on the subject. The author will have referenced other work to support their arguments, so looking at the references of a text you've found should lead you to similar material. 

Some databases will index these references within the article record, making them easier to locate. Some (e.g. Google Scholar, Scopus and Web of Science) will even link to later articles which have gone on to cite the text you've found.

The tree of cited and citing articles allows you to follow an academic argument through time

By tracing the cited and citing articles, you can follow an academic argument as it develops through time.

The number of times an article has been cited can act as an indicator of that article's perceived importance within a topic. Bear in mind that this may not be an indicator of its quality -- it may be a notorious paper for the wrong reasons! Also, bear in mind that the older a paper is, the more chance it's had to be cited. An article published this year is unlikely to have had chance to be cited.