Grading and Admissions Data for England (GRADE) user guide
Updated 24 January 2025
Applies to England
GRADE (Grading and Admissions Data for England) makes available linked micro-data from听Ofqual, the听Department for Education (DfE)听and the听听covering the period from 2017 to 2022. The GRADE data set includes detailed information on students鈥 progression from school to universities and colleges. It includes information on students鈥 qualifications, prior schooling, socio-economic background and their university and college applications and admissions.
This user guide introduces researchers to the GRADE听data sharing initiative and the GRADE data set. It provides an overview of the purpose and principles of making available linked data for independent research on the educational, assessment and admission systems in England. It also outlines the content of the GRADE data set and provides an overview of the data and linking procedures used for its production.
This user guide is published by Ofqual. It was updated in听January 2025听alongside the second release of the GRADE data set.听
Aim of GRADE
GRADE is a joint open data initiative by Ofqual, the Department for Education (DfE), and UCAS. The main aim is to make data available for independent research that serves the public good.听Table 1 gives an overview of the data owning organisations鈥 remit and reasons for sharing data.
罢丑别听initial aim of the GRADE听data sharing project was to allow external researchers to conduct independent high-quality evaluation of the judgements made in awarding grades in 2020. Now, GRADE aims to more broadly enable research to enhance the quality of the educational, assessment and university admission system in England and produce evidence to inform future education policy.
Table 1. Organisations鈥 remit and reasons for sharing data: an overview
Data owner | Status | Organisation鈥檚 remit | Reasons for sharing data |
---|---|---|---|
Ofqual | Non-ministerial government department with jurisdiction听听听over regulated qualifications provided in England | Regulate for the validity of qualifications, ensure听听fairness to learners in England and promote public confidence in the system | Facilitate the carrying out of programmes of research and听听听retrieving evidence for purposes in line with its remit |
Department for Education | UK government department | Responsible for education, children鈥檚 services, higher and听听听further education policy, apprenticeships, and wider skills in England, and听听听equalities | Promote research and analysis to provide guidance or听听听advice on education and/or well-being of children in England |
UCAS | Charity operating the application process for UK universities | Provide evidence into higher education access and outcomes (Higher Education Research Act 2017, Section 79) | Promote more comprehensive statistical analysis to allow听听听the performance of tasks carried out in the public interest and support听听听efforts to promote research under the Digital Economy Act |
The GRADE data set
GRADE makes available a data set of linked micro-data from Ofqual, DfE and UCAS covering the period between 2017 to 2022. The GRADE data set includes detailed information on students鈥 progression from school to universities and colleges. It includes information on students鈥 qualifications, prior schooling, socio-economic background and their university and college applications and admissions.
Research potential of the GRADE data set
The GRADE data set can be used for research on the educational, assessment and higher education admission systems in England. It provides opportunity to investigate linked data on students鈥 granular attainment at GCSE and A level, their prior attainment, detailed socio-demographic characteristics and their applications to university and college. Covering alternative arrangements for grading in 2020 and 2021, it also provides a unique opportunity to compare and evaluate assessment methods.听
Using the GRADE data set, researchers can for instance investigate questions around:
- equality and the effects of socio-demographic factors on education and university admission
- educational trajectories and the factors influencing students鈥 choices at A level and applications to university
- fairness of assessment
- longitudinal trends in the educational, assessment and admission systems, particularly the effects of the Covid-19 pandemic
罢丑别听GRADE research output repository听provides researchers an up-to-date list of published research and analysis using the GRADE data set.
Structure of the GRADE data set
The GRADE data set includes 3 sources of administrative data. Each data source includes multiple data tables.
- Ofqual dat补听鈥 data on GCSE and A level qualifications and examinations collected from awarding organisations
- awarding tables
- grade boundaries table
-
DfE dat补听鈥 extracts of the National Pupil Database (NPD) for key stage 4 and key stage 5 students
- exam results tables
- student tables
- prior attainment tables
- census tables
-
UCAS dat补听鈥 data from the university and college application and admission process
- apply qualifications tables
- applications tables
- applicant tables
The next section on the听Content of the GRADE data set听provides an overview of the information contained in the 3 data sources and each table. The data tables, within and across data sources, are linked together by common identifiers, see section on听Data linking.听More detail on the variables included in each data table of each dataset is provided in the听data specifications.
Content of the GRADE data set
This section presents an overview of the information contained in the GRADE data set. More detail is provided in the听data specifications.
The GRADE data set as a whole covers the period from 2017 to 2022. It covers six cohorts of students, including those who were awarded GCSE and A level grades in 2020 and 2021 when schools were closed, and exams were cancelled because of the Covid 19 pandemic.听
Ofqual data
Ofqual data includes information on GCSE and A level qualifications and examinations taken by students of all ages in England.听
The awarding tables include detailed information on the qualifications taken (such as subject and tier of entry) and on students鈥 attainment. The grade boundaries table can be linked to the awarding table to help with interpreting marks and grades.
The awarding tables and grade boundaries table are based on听Summer awarding data. Summer awarding data is provided to Ofqual by awarding organisations prior to grades being issued to centres and students in August each year. The grades and qualifications included in the summer awarding data are provisional. The awarding tables included in the GRADE data set enrich the summer awarding data with: information on final grades awarded after all types of reviews concluded, an indicator of Key Stage 2 attainment for GCSE students and indicators of GCSE attainment for A level students, and demographic information about students, such as gender and age. The grade boundaries table can be linked to the awarding table of the corresponding year and qualification level using variables SpecCodeNoP and CertCodeNoP for linkage. The grade boundaries table is only available for 2022.
For the periods 2017 to 2019 and 2022 when exams were taken normally, the awarding tables include qualification grades and marks.听听In 2020 and 2021, exams were cancelled and alternative approaches to grading were taken. For 2020, the awarding tables include Centre Assessment Grades and calculated grades from the听, which were used to determine the awarded grade. For 2021, the awarding tables include听. For summer 2022, exams and other formal听assessments went ahead with some planned adaptations听intended to recognise the disruption to education caused by the COVID-19 pandemic, and grading was at a midpoint between summer 2021 and 2019.
Department for Education data
Data from the Department for Education is based on the听. The NPD is compiled by the Department for Education (DfE) from data supplied by schools, local authorities, centres and awarding organisations. The NPD constitutes the main source of information for the computation of accountability measures and is widely used for research purposes.
There are 4 main extracts of the NPD shared as part of GRADE. The NPD exam results tables contain student-level results data by qualification taken for Key Stages 4 and 5, covering A levels, GCSE and other general and vocational or technical qualifications, as provided by awarding organisations. The exam results tables include aggregated achievement indicators, such as those related to EBacc, Attainment 8 and Progress 8.
The NPD student tables contain data on students, including demographic and protected characteristics, such as gender, age, language spoken, free school meal eligibility and special educational needs. Information on the centre attended by students is also available, covering the type of school or institution, its description, and the centre鈥檚 admissions policy.听
The census tables contain additional information on students, including further socio-demographic characteristics (such as ethnicity) and socio-economic indicators such as the Income Deprivation Affecting Children Index (IDACI).听
The prior attainment tables contain data on students鈥 prior attainment including achievement indicators, teacher assessments and test results for Key Stages 1 and 2.
UCAS data
UCAS data is based on the information gathered to operate the听. This comprises data submitted by applicants to the UCAS undergraduate scheme and by the HE institution receiving prospective students鈥 applications. UCAS data features three main data tables.
The applicant table contains information on applicants to the UCAS undergraduate scheme. For each applicant, this data set includes data on demographic characteristics (such as gender, age, geographical region) and socio-economic characteristics (such as ethnicity, socio-economic background, deprivation index).
The apply qualifications table is at qualification level and contains information on qualifications declared by the applicant during their application. This includes the A level grades predicted by teachers and submitted as part of an applicant鈥檚 application to higher education.
The applications table contains the data included in the application made by each applicant, the kind of offer made by each HE provider and the response from the applicant. It does not contain the specific offer letters for individual students. This allows researchers to have access to a wealth of data, including: the applications that did not receive an offer, if an offer was received which kind of offer (unconditional, conditional) was made, whether each offer was accepted as either firm or insurance.
Data coverage
The coverage of the 3 data sources is slightly different. Table 2 summarises the provenance and the coverage of each data source. Ofqual data is collected by qualifications, DfE data is collected by Key Stage and UCAS data is collected by applications to the undergraduate scheme. This means that Ofqual data includes students of any age, while DfE data only includes students in Key Stage 4 and 5. UCAS data includes all university and college applicants which is a subset of Key Stage 5 students (from DfE data) and A level students (from Ofqual data).听
Table 2: The provenance and coverage of each data source
Data source | Provenance | Population coverage | KS4 and GCSE included | KS5 and A level included |
---|---|---|---|---|
Ofqual | Summer awarding data as routinely collected on听听听qualifications achieved by students and other data collections submitted by听听听awarding organisations | GCSE and A levels achieved by learners in schools in听听听England between 2017 and 2022 | Yes | Yes |
DfE | National Pupil Database - Submission of data on students as听听听statutory requirement on schools, and exams results as submitted by exam听听听boards | School-aged learners in England | Yes | Yes |
UCAS | University application process - Data obtained as part of听听听UCAS鈥 position as the provider of a central admissions service for full-time听听听undergraduate courses for higher education providers within the UK | English 18-year-olds for the 2017 to 2022 application听听听cycles | No | Yes |
Data linking
Data from Ofqual, DfE and UCAS do not originally share any unique identifiers for students or schools. We produced such shared unique identifiers by linking data between the 3 data sources.
The linking procedure was based on the use of identifying information that is shared across the three data sources, including information about schools and students鈥 personal information (first name, middle name, surname, date of birth and gender). Students鈥 personal information was only used for data linking and will never be shared with external researchers.听
Several linking rounds were performed, starting from exact matches and resuming with some lower-quality matching. A variable indicating the quality of the matching will be available to researchers who will therefore have the option to rely on different levels of linking quality. As a result of the linking procedure, two anonymised identifiers, one for each student and one for each school, were generated. These identifiers are not used in any existing operational systems by any of the data owners
Data linking was done by Ofqual. Any questions about data linking can be addressed to听data.sharing@ofqual.gov.uk.
Data limitations
Data included in the GRADE data set was originally collected for non-statistical reasons, such as for the delivery of a public programme or service, the delivery of qualifications, or for maintaining school records. Research needs are not generally considered as part of the data collection design. As a result, the GRADE data set听
- may lack potentially useful information
- may contain errors and systematic inaccuracies related to the data collection procedures
GRADE includes administrative data that is also used for official statistics or in other research reports published by the data owners. Minor differences between the GRADE data set and those other published statistics are expected due to separate processing.
Researchers are not allowed to link the GRADE data set to additional data on individual students, schools, exam boards or HE providers that may be available from external sources.
Releases and updates to the GRADE data set
The GRADE data set was first released in 2021. This first release covered听the time period between听2017听and听2020. In 2024, the GRADE data set was re-released and updated to additionally cover 2021 and 2022.听
Data specifications of this听second release of the GRADE data set differs听slightly from the first release. In the current GRADE data set, all body corporate identifiers (such as names of exam boards, schools and universities) have been pseudonymised. This includes body corporate identifiers which had not been pseudonymised in the first release of GRADE 诲补迟补.听
Related data sets
- 听(NPD)
- Longitudinal Education Outcomes听(LEO)
- 听(GUiE)
Data governance: accessing the GRADE data set
Requesting access
The GRADE data set is shared by the听听under the听Digital Economy Act 2017听and can be accessed under the听听via the听听(SRS) and/or the听听(IDS).
To request access to the GRADE data set, researchers need to complete 2 steps:听
- Become an accredited researcher with the Office for National Statistics.
- Complete an application for the specific research project
Becoming an accredited researcher
The GRADE data set is only shared with听, who have demonstrated the skills and knowledge needed to use the data appropriately. Becoming an accredited researcher allows researchers to gain access to data stored in the听听(SRS) and/or the听听(IDS) and to carry out analyses in the SRS or IDS environment.
贵耻濒濒听听is available from the听. In brief, to be an accredited researcher, applicants must have an undergraduate degree (or higher) including a significant proportion of maths or statistics or be able to demonstrate at least 3 years of quantitative research experience. Successful completion of a 鈥楽afe Researcher鈥 training course is also required as part of the accreditation process.
Researchers can apply for accreditation through the听. Once granted, accredited researcher status is valid for 5 years, after which the researcher will need to apply again.
Applying for a specific research project
To request access to the GRADE data set, researchers have to submit a research project application to the Office for National Statistics听. The research project application includes a list of all people involved in the research, a detailed research proposal and a check for ethical approval of the research.听
Researchers should apply only for the data they need for their research project. In their application, they should select the relevant data sources, which may be:
- Ofqual-only
- Ofqual and DfE
- Ofqual and UCAS
- Ofqual, DfE and UCAS
DfE-only data and UCAS-only data cannot be requested as part of GRADE.
In the project application form, researchers must state how and for which purpose each data source will be used. Researchers should clearly state the connection between the aim of their research project, the听Aim of GRADE听and the听data owner鈥檚 remit. This constitutes a key element for the accreditation of the project.
Researchers are advised to use the听. They can also refer to an听, which has a similar structure and format to GRADE research project applications.听
Researchers who are yet to become accredited researchers can still submit a research project application, but they will have to be accredited before being able to access the GRADE data set (as described in the section on听becoming an accredited researcher.听
Ethical considerations
The GRADE data set can only be used for research that is legal, ethical and feasible. Research projects must demonstrate a clear public benefit. Research outcomes must be published. Detailed criteria can be found on the听UKSA website.听
The GRADE application review process
The application review process has 3 stages:听
- Stage 1.听Applications are received and checked by the Office for National Statistics听.
- Stage 2.听Applications that pass the initial check, and feasibility assessment are reviewed by Ofqual and the Department for Education as data owners. UCAS chose to be informed about applications but not to conduct a data owner review.
- Stage 3.听Applications that are approved by the data owners are then scrutinised and accredited or rejected by the听
At each stage of the process, researchers can expect to receive feedback and requests to make minor or major revisions to their application.
Data security
To ensure the safe access and secure use of the data, the Office for National Statistics听听(SRS) and听听(IDS) use the听. The 5 safes are:
- Safe People听鈥 only trained and accredited researchers are trusted to use data appropriately.
- Safe Projects听鈥 data are only used for valuable, ethical research that delivers clear public benefits.
- Safe Settings听鈥 access to data is only possible using secure technology systems.
- Safe Outputs听鈥 all research outputs are checked to ensure they cannot identify data subjects.
- Safe Dat补听鈥 researchers can only use data that have been de-identified.
Safe People and Safe Projects: accreditation of researchers and projects
The first 2 Safes are achieved through the accreditation of researchers and the accreditation of projects (as detailed in听Data governance: accessing the GRADE data set).听
Safe settings: Office for National Statistics Secure Research Service (SRS) and Integrated Data Service (IDS)
External researchers access the data through the Office for National Statistics听听(SRS) and/or听听(IDS) which are Accredited Processors under the听.
The use of the SRS and IDS requires 补听听to access data. Once a project has been accredited, external researchers can access the GRADE data through Office for National Statistics听听or safe settings provided by approved Assured Organisational Connectivity organisations. The most popular route to use the SRS or IDS is under the听听which enables researchers from approved organisations to access projects directly from their organisational office or remotely from home. The organisation must achieve certification under the scheme which ensures that they meet the safe setting criteria by using secure technology and systems.
Safe outputs: Statistical Disclosure Control
A safe output is an output that is non-disclosive and maintains the confidentiality of the data subjects. All research outputs are checked by the SRS or IDS Statistical Support team to ensure that data subjects cannot be in any way identified. This involves the application of Statistical Disclosure Control methods to ensure that all outputs are non-disclosive. Any statistical output computed on fewer than 10 data subjects will not be disclosed and cannot be exported outside of the SRS or IDS. The SRS or IDS Statistical Support team provide guidance and checks until the publication of research outcomes.
Researchers working in the SRS and/or IDS are not allowed to copy or export extracts of the data. Researchers are not allowed to report or discuss their findings until their output is checked. This is to ensure that no information regarding the data can leave the system without being cleared by the SRS or IDS Statistical Support team.
Safe data: de-identification and pseudonymisation
External researchers will be only able to access data that has been de-identified and pseudonymised. Researchers are required not to identify individuals from the de-identified data and are reminded that it would be a criminal offence to do so. An Individual Declaration Form is required to be signed by each accredited researcher before access is given.听
Personal identifiers (including first name, surname, date of birth, school or college attended) are used for linking the data across data sources (Data linking) but will never be shared with researchers. Instead, meaningless identifiers created for each student and each school. A pseudonymisation algorithm was applied to both identifiers.听
Legislation
Legal gateways to data sharing and processing
Ofqual, DfE and UCAS maintain ownership of their respective data within the GRADE dataset. Ofqual, DfE and UCAS act as joint controllers of the GRADE dataset.听
Ofqual may co-operate or work jointly with another public authority where it is appropriate to do so for the efficient and effective performance of its qualification functions under听. This encompasses the sharing of data.
DfE is able to onwardly share 鈥渋ndividual pupil information鈥 collected from schools through听听with 鈥淧rescribed Persons鈥. The Education (Individual Pupil Information) (Prescribed Persons) (England) Regulations 2009 go on to prescribe which persons may be provided with 鈥渋ndividual pupil information鈥 under section听, although this was subsequently amended and updated in the听.(ii) Legislation within section听听covers the sharing of Key Stage 4, Key Stage 5 for learners in FE colleges with named bodies and third parties as prescribed in听.
UCAS data sharing is in accordance with听, and related Data Protection Act provisions, for the purposes of legitimate interests pursued by the controller or by a third party where such interests are overridden by the interests or fundamental rights and freedoms of the data subject which require protection of personal data, in particular where the data subject is a child. This sharing actively supports efforts to meet government aims to encourage fairness and scrutiny of the higher education application process and promotes research in this field in accordance with the aims of the听.
罢丑别听听补濒濒辞飞蝉 the processing of personal data by an organisation where 鈥減rocessing is necessary for the performance of a task carried out in the public interest or in the exercise of official authority vested in the controller鈥 (Article 6(1)(e)) and states that special categories of data may be used for historical research or statistical purposes (Article 9(2)(j)).听
听补濒濒辞飞蝉 DfE and Ofqual to process personal data for their public task and听听补濒濒辞飞蝉 UCAS to process personal data for the purposes of legitimate interests. 听
In addition,听听补濒濒辞飞蝉 all organisations to process special category data where it is necessary for the purposes of research (such processing to be proportionate to the aims pursued and subject to safeguards).听 Special categories of data may be processed for research purposes when (i) this is in the public interest, (ii) the processing is not likely to cause substantial damage or distress to a data subject, and (iii) the processing is not carried out to take measures or make decisions with respect to a particular data subject.
础濒蝉辞,听听补濒濒辞飞蝉 the processing of special categories of data for reasons of substantial public interest to ensure equality of opportunity or treatment with regards to certain categories of data concerning specific groups of people.
Legal gateways to accessing data for research
The Five Safes Framework established under the听听enables governmental bodies to share administrative data for the purposes of research. Only researchers accredited by the UK Statistics Authority can have access to the data to conduct research projects also accredited by the UK Statistics Authority. To be shared under the Digital Economy Act, however, the data has to be completely de-identified or functionally anonymised.听
Any researcher who wants to access the data must be accredited (see听Becoming an accredited researcher) and the research project must have been approved (see听The GRADE application review process). In order for research projects to be approved they must comply with the听Research Code of Practice and Accreditation Criteria听which was approved by the UK Parliament in July 2018.
Data processing involved as part of this initiative is compliant with all applicable data protection legislation, including the听.
Data privacy
A key transparency requirement under the听听is that individuals have the right to be informed about the collection and use of their data. The intent to enable independent research by making data available to external researcher was听.
Each data owning organisation publishes a privacy notice on how it processes and protects personal information.听
- Ofqual鈥檚 personal data protection policy
- Department for Education鈥檚 NPD privacy notice
- Department for Education鈥檚 personal information charter
Given that this project is based on the linking of data sets from the different organisations which will broaden the set of information available for each student, all data owners also jointly issued:
- Privacy notice for the initial release of the GRADE data set
- Privacy notice for the second release of the GRADE data set with additional data for 2021 and 2022
The privacy notices provide students with information on why their data is shared, how their data is processed, the retention period for the data, and how the data will be shared.听
Resources
A range of useful resources are available to researchers interested in working with the GRADE data set.听
- data specifications, with the details (names and descriptions) of all the variables in each data table for all three data sources
- a low-fidelity synthetic GRADE data set which is available upon request听from Ofqual data.sharing@ofqual.gov.uk
- 补听research output repository, which provides an up-to-date list of the research and analysis conducted with the GRADE data set
Funding opportunities
Research projects using the GRADE data set should be independently funded. To promote the use of the GRADE data set, Ofqual, DfE and UCAS have partnered with听听to help researchers overcome funding barriers.
In 2021, ADR UK launched 补听听with funding for up to 12 months in duration and a maximum of 拢130,000.听
In 2024, GRADE was included in the听, with funding up to 18 months in duration and up to 80% of a maximum of 拢200,000 at full economic cost.听
Future funding opportunities will be advertised here if and when they become available.
Contact
For further information about accessing the GRADE data set and the application and accreditation process:
- Office for National Statistics the Secure Research Service (SRS)听srs.customer.support@ons.gov.uk
- Office for National Statistics Integrated Data Service (IDS)听ids.customer.support@ons.gov.uk
For further information about the GRADE data set, related resources or for research feasibility queries or feedback about the GRADE data set:
- 翱蹿辩耻补濒听data.sharing@ofqual.gov.uk
For queries specific to the individual data sources in the GRADE data set or for information about听organisational remit and data governance, please contact the data owner:
- 翱蹿辩耻补濒听data.sharing@ofqual.gov.uk
- Department for Education听data.sharing@education.gov.uk
- UCAS听stats@ucas.ac.uk
For queries regarding ADR UK funding opportunities:
- ADR UK 听hub@adruk.org