Résumé parsing
Résumé parsing, also known as CV parsing, résumé extraction, or CV extraction, allows for the automated storage and analysis of résumé data. The résumé is imported into parsing software and the information is extracted so that it can be sorted and searched.
Description
Résumé parsers analyze a résumé, extract the desired information, and insert the information into a database with a unique entry for each candidate. Once the résumé has been analyzed, a recruiter can search the database for keywords and phrases and get a list of relevant candidates. Many parsers support semantic search, which adds context to the search terms and tries to understand intent in order to make the results more reliable and comprehensive. The candidates returned are ranked based on how closely they match the keywords and job profile.Machine learning
is extremely important for résumé parsing. Each block of information needs to be given a label and sorted into the correct category, whether that's education, work history, or contact information. Rule-based parsers use a predefined set of rules to parse the text. This method does not work for résumés because the parser needs to "understand the context in which words occur and the relationship between them." For example, if the word "Harvey" appears on a résumé, it could be the name of an applicant, refer to the college Harvey Mudd, or reference the company Harvey & Company LLC. The abbreviation MD could mean "Medical Doctor" or "Maryland". A rule-based parser would require incredibly complex rules to account for all the ambiguity and would provide limited coverage.This leads us to Machine Learning and specifically Natural Language Processing. NLP is a branch of Artificial Intelligence and it uses Machine Learning to understand content and context as well as make predictions. Many of the features of NLP are extremely important in résumé parsing. Acronym normalization and tagging accounts for the different possible formats of acronyms and normalizes them. Lemmatization reduces words to their root using a language dictionary and Stemming removes “s”, “ing”, etc. Entity extraction uses regex expressions, dictionaries, statistical analysis and complex pattern-based extraction to identify people, places, companies, phone numbers, email addresses, important phrases and more.
Effectiveness
Résumé parsers have achieved up to 87% accuracy, which refers to the accuracy of data entry and categorizing the data correctly. Human accuracy is typically not greater than 96%, so the résumé parsers have achieved "near human accuracy."One executive recruiting company tested three résumé parsers and humans to compare the accuracy in data entry. They ran 1000 résumés through the parsing software and had humans manually parse and enter the data. The company brought in a third party to evaluate how the humans did compared to the software. They found that the results from the résumé parsers were more comprehensive and had fewer mistakes. The humans did not enter all the information on the résumés and occasionally misspelled words or wrote incorrect numbers.
In a 2012 experiment, a résumé for an ideal candidate was created based on the job description for a clinical scientist position. After going through the parser, one of the candidate's work experiences was completely lost due to the date being listed before the employer. The parser also didn't catch several educational degrees. The result was that the candidate received a relevance ranking of only 43%. If this had been a real candidate's résumé, they wouldn't have moved on to the next step even though they were qualified for the position. It would be helpful if a similar study was conducted on current résumé parsers to see if there have been any improvements over the past few years.
Benefits
- A famous study conducted by Marianne Bertrand and Sendhil Mullainathan in 2003 looked at whether candidates with the names Emily and Greg were more employable than Lakisha and Jamal. The conclusion was that résumés with white-sounding names received 50% more callbacks than ones with black-sounding names. In 2014, a study was done in Australia and New Zealand to investigate name discrimination based on gender. Insync Surveys, a research firm and Hays, a recruitment specialist sent out a résumé to 1,029 hiring managers with the name being the only difference. Half the hiring managers received a résumé for Simon Cook and the other half got a résumé for Susan Campbell. The study found that Simon was more likely to get a callback. Résumé parsing allows candidates to be ranked based on objective information and can help prevent the bias that so easily shows up in the hiring process. The software can be programmed to ignore and hide factors that contribute to bias such as name, gender, race, age, address and more.
- The technology is extremely cost-effective and a resource saver. Rather than asking candidates to manually enter the information, which could discourage them from applying or wasting recruiter's time, data entry is now done automatically.
- The contact information, relevant skills, work history, educational background and more specific information about the candidate is easily accessible.
- The applicant screening process is now significantly faster and more efficient. Instead of having to look at every résumé, recruiters can filter them by specific characteristics, sort and search them. This allows recruiters to move through the interview process and fill positions at a faster rate.
- One of the biggest complaints people searching for jobs have is the length of the application process. With résumé parsers, the process is now faster and candidates have an improved experience.
- The technology helps prevent qualified candidates from slipping through the cracks. On average, a recruiter spends six seconds looking at a résumé. When a recruiter is looking through hundreds or thousands of them, it can be easy to miss or lose track of potential candidates.
- Once a candidate's résumé has been analyzed, their information remains in the database. If a position comes up that they are qualified for, but haven't applied to, the company still has their information and can reach out to them.
Challenges
Résumé optimization
Résumé parsers have become so omnipresent that rather than writing to a recruiter, candidates should focus on writing to the parsing system. Understanding how they work is a great first step, but there are also specific changes an applicant can make to optimize their résumé. Here are some tips on how to do that:- Use keywords from the job description in relevant places on your résumé. These keywords will almost certainly be included in the parsing process.
- Don't use headers or footers. They tend to confuse the parsing algorithms.
- Use a simple style for fonts, layouts and formatting.
- Avoid graphics.
- Use standard section names such as “Work Experience” and “Education”.
- Avoid using acronyms unless they're included in the job description. The safest option may be to write the long form and include the acronym after in parentheses.
- Don't start with dates in the "Work Experience" section. Parsers typically look for dates following job titles or company names.
- Stay consistent with formatting past work experience. The standard is job title, company title, and then employment dates.
- Most résumé parsers claim to work with all of the main file types, but stick with docx, doc and pdf to be on the safe side.
Software and vendors
With recent advancements in AI sophistication and Machine Learning, and the text mining and analysis processes improvements, which ensure up to 95% accuracy in the data processing, many AI technologies have sprung up to help the job seekers in the creation of application documents. These services focus on creating ATS-friendly résumés, execute résumé check and screening, and help with all of the preparation and application processes. Some of the AI builders, such as Leap.ai and Skillroads, concentrate on the résumé creation while others, like Stella, also offer help with the job hunt itself as they match candidates to appropriate vacancies. In 2017, Google made an attempt at dismantling the US$215.68 Bn global recruitment market via the creation of Google for Jobs, which is predicted to greatly affect the labor market. This expansion to the search engine uses Cloud Talent Solution, Google's own invention, which is another iteration of the smart AI résumé builder and matching system.
Future
Résumé parsers are already standard in most mid- to large-sized companies and this trend will continue as the parsers become even more affordable.A qualified candidate's résumé can be ignored if it is not formatted the proper way or doesn't contain specific keywords or phrases. As Machine Learning and Natural Language Processing get better, so will the accuracy of résumé parsers.
One of the areas résumé parsing software is working on expanding into is performing contextual analysis on the information in the résumé rather than purely extracting it. One employee at a parsing company said “a parser needs to classify data, enrich it with knowledge from other sources, normalize data so it can be used for analysis and allow for better searching.”
Parsing companies are also being asked to expand beyond just résumés or even LinkedIn profiles. They are working on extracting information from industry-specific sites such as GitHub and social media profiles.