We often use these two words interchangeably, and the word data is often used in a broad sense to refer to a collection of facts, numbers, and statistics collected together. In academic research the two are different and the difference is important in order to understand exactly what you are looking at and, crucially, what you can do with it.
Raw numbers collected as part of a study and stored.
Usually found in the form of a digital dataset - a collection of related sets of information/data kept as machine-readable files, that can be filtered and searched according to your own criteria, and can be analyzed using software such as Excel and SPSS.
Statistics are the result of some human analysis of the raw collected data. Data has been interrogated and processed in some way and decisions have been made on how to present that data to show a particular view of what is going on.
You will usually see statistics in tables, charts, or graphs, and also as numbers and percentages reported in articles.
Once a statistic is published it is static and only ever refers to that point in time.
Statistics can seem persuasive, but beware. They can often be used to make a weak argument seem stronger. Think critically about statistics that are presented to you, and decide whether you think they are strong enough as evidence to prove an argument. Think about what that stat really tells you and what it leaves out.
For more on how to read statistics with a critical eye, see the book:
"Damned lies and statistics: untangling numbers from the media, politicians, and activists", by Joel Best.
We subscribe to a number of statistical data sources, including international statistics, datasets, and market reports. Find out more via the links below:
There are many databases that specialise in making data available and searchable. Often these are produced by the people who gather the data, and the date included in each will depend on what is of interest to the organisation/s that have gathered it.
Some of the larger and more comprehensive databases for social and economic data are good places to start:
You may find useful statistics reported in journal articles. Someone else may have already done research in the area you're interested in. Scholars usually publish their findings in journal articles, including some of the data. You will also find stats in newspapers and magazines. Do be critical of your sources and follow them up though to be sure they are reputable and authoritative sources though.
The Library subscribes to many databases for searching for journal articles and other secondary sources. See your own Library Subject Guide for which databases to try when searching for journal articles.
Searching the web can be a minefield. Here sre some tips for searching on the web for statistics:
Add in words like data or statistics to your search terms
You can search particular sites or domains using advanced search functionality. For example, the site command in Google. A search for site:gov entered in with your keywords will only search sites with gov in the URL. See individual Help sections on Search Engines for more information.
YorSearch and Library catalogues tell you about particular titles a Library has in stock, some of which may be statistical in nature. The word statistics will be in the subject terms field, so you can use this word in a subject terms search in the advanced search.
You will probably be looking for data about certain people, activities or commodities.
If you're looking for data on people, you need to think about what 'social unit' you're interested in. You may be interested in individuals, couples, households or families. You may be after particular groups, like companies or political organisations. You may be interested in a nation. Particular groupings may also be important to your search — for example, race, nationality or gender.
If may be that you're interested in 'things' rather than people: for example, commodities like cars. Or you may be interested in a particular activity, like voting.
Defining exactly what is in scope and what is out of scope before you start searching will save you time later.
Most current statistics may actually be a year or more old as there are time lags whilst the information is collected, processed, and released.
Some sources of data offer huge banks of data that are gathered over years and you can interrogate these to do over-time comparisons.
Some sources just offer quick snapshots of a particular point in time.
Geography is usually a factor. There is usually a place that is the focus of your enquiry.
In order to find the right source to search for your data you need to consider the following:
Government departments - they collect data to aid them with policy decisions. This can be ministerial and non-ministerial departments.
Not-for-profit organisations - there are organisations who collect and publish styatistics to support their own agendas or aims. E.g. the World Health Organisation or the World Monetary Fund.
Commercial firms - for example marketing companies.
Academics - researchers and institutions gather and publish data as part of research projects. You can often find journal articles about topics that contain statistics as evidence. But you may be able to get access to whole datasets.
You may need to be a detective and do some Internet searching to determine who you think the main people are with a vested interest in your topic area. Work out if they publish data on their own websites.
If you are interested in particular countries it is a good idea to work out the government structure in order to work out who will likely publish stats on particular themes.
Flexibility and detective work are essential.
What you are looking for may never have been collected or may not be published for people to see. This is true of some countries where the data is just not made available. It may be that if it is very current information it is not available yet, or if it is older it may no longer be available to view. Can you refine your topic of investigation, changing either the geographical area, the timeframe or the other variables in some way?
Have you used the specialist tools and drawn a blank? Can you identify other organisations who may collect that data and explore their websites? Can you explore journal articles and other secondary sources instead?
Are you using the tool you're using to search effectively? Information databases and web search engines have help sections and tutorials with search tips.
It's one thing finding some data, but you probably need to interrogate it or manipulate it in some way...
Forthcoming sessions on :
There's more training events at: