Green, Thomas Alexander Fleming ORCID: https://orcid.org/0000-0002-5643-2473 (2023) Using NLP to resolve mismatches between jobseekers and positions in recruitment. PhD thesis, University of Sheffield.
Abstract
Recruiting through online portals has seen a dramatic increase in recent decades and it is challenging for job seekers to evaluate the overwhelming amount of data to efficiently identify positions that align with their skills and qualifications. This research addresses this issue by investigating automatic approaches that leverage recent developments in Natural Language Processing (NLP) that search, parse, and evaluate the often unstructured data in order to find appropriate matches. We present the development of a benchmark suite consisting of an annotation schema, training corpus and baseline model for Entity Recognition (ER) in job descriptions, published under a Creative Commons licence. The dataset contains 18.6k entities comprising five types: Skill; Qualification; Experience; Occupation; and Domain. We develop a benchmark Conditional Random Fields (CRF) ER model which achieves an F1 score of 0.59, and our best performing model utilises Bidirectional Encoder Representations from Transformers (BERT) and achieves an F1 score of 0.73. We consider different ways of framing the matching problem and develop Machine Learning (ML) models to address each. We propose that the Natural Language Inference (NLI) paradigm most closely aligns with the matching problem. Our best performing model utilises decomposable attention and achieves an F1 score of 0.73 on a job application success prediction task. Finally, we integrate the ER and success prediction models into a cohesive pipeline that predicts whether a given job application made by a user will be successful, which can be extended into a system that recommends suitable jobs to a user. Although we observe poorer results on this pipeline relative to a more simple input truncation approach, we suggest this may be limited by the ER component for feature selection and the entity encoding process.
Metadata
Supervisors: | Diana, Maynard and Chenghua, Lin |
---|---|
Keywords: | nlp; recruitment; job recommendation; career; skills; ner; |
Awarding institution: | University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Computer Science (Sheffield) The University of Sheffield > Faculty of Science (Sheffield) > Computer Science (Sheffield) |
Depositing User: | Dr Thomas Alexander Fleming Green |
Date Deposited: | 04 Apr 2024 09:41 |
Last Modified: | 04 Apr 2024 09:41 |
Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:34220 |
Download
Final eThesis - complete (pdf)
Filename: phd_thesis.pdf
Licence:
This work is licensed under a Creative Commons Attribution NonCommercial NoDerivatives 4.0 International License
Export
Statistics
You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.