Job scraping integration with WebSpiderMount

Job wrapping / spider software product website:

Service description & FAQ | Pricing | Demo | Request quote for job wrapping service|

Contents

1. Why our job wrapping services?
2. Job spider features
3. Which job sources to wrap
4. What is the spider tool (job wrapping) designed for?
5. How does the spider extract jobs from employer websites?
6. Can we scrape only specific jobs?
7. Will the jobs spidered look the same as on the source site and as my other job adverts on destination job board?
8. How can I sort jobs out on my website if I have predefined lists of states or categories?
9. Once the posting expires on the employer’s website, is it automatically removed from my job board?
10. What happens when a candidate applies to a spidered job?
11. Are the applications saved / logged in job board database?
12. Will candidates be always redirected to a specific job page on the employer website?
13. Will employers get application notifications?
14. How often can the spider revisit employer websites for job updates?
15. Can any website be spidered?
16. How do I set up the spider for my JobMount job board software?
17. How do I set up the spider for my 3rd party software?

Why our job wrapping services?

1. Confidence in jobs delivery
SpiderMount ensures comprehensive jobs coverage and accuracy.

2. Seamless support
Client’s resources are freed from daily checks by SpiderMount support.

3. Improved quality of job listings
Taxonomy for industry mapping, data filtering, conversion and clean up.

4. Effortless integration
SpiderMount connects to job board & ATS APIs, proprietary systems.

Job spider features

  • Accurate and comprehensive job scraping
  • Quality control, scrapes monitoring & real time reporting
  • Vacancy content clean up and improvement
  • Jobs synchronization via your database API
  • Jobs format conversion, mapping to your database values
  • All jobs or selective scraping, keyword filtering
  • Daily or custom scraping frequency & scheduling
  • Jobs taxonomy: generic and custom category identification
  • Job replication using multiple locations or extra titles
  • Job distribution: bulk posting to other job boards
  • Client Dashboard for scrapes management
  • Immediate restarts of scrapes by clients

Which job sources to wrap

Please review our recommendations for selecting sources effectively:

  1. Search for the websites of direct employers and recruitment agencies, but NOT job boards and aggregators website (e.g. Monster, CareerBuilder, Indeed, Google).
  2. To find employers in a specific industry, try browsing the Government directories (might be present on government websites in some niche) or digests with a list of best employers from the niche etc.
  3. Find the Career or Job section on the employers website. Verify if there is a jobs list page to scrape content from.
  4. Check if jobs from this company are already present on Indeed and SimplyHired: Run the search on Indeed by exact job title and location. Check if this particular job posting is already present. If you have found it, please, try checking another job by this company. If it also posted on Indeed, it is better not to use this employer for scraping.

What is the spider tool (job wrapping) designed for?

Job wrapping service: job spider navigates career website pages and collects jobs to re-post them to your job board.

FAQ Job Wrapping

How does the spider extract jobs from employer websites?

Spider follows specific URLs pre-configured for navigation and saves full HTML pages’ sources.

Parsing module of the spider extracts job data from the HTML converting it to format used by your job board technology, so data becomes available to be posted to your job board job fields mapped.

As per screen shot above:

  • Job ID on the employers website should be put in “Job Ref” field of your job board application.
  • Location on employer website contains State and City. Parsing Module is able to divide these parts and mark each of them as corresponding fields on your job board.
  • Job Description: parsing module can separate full job description together with HTML tags on employer’s website and apply it in full view on your job board software.

Important: Spider technology is sensitive to changes of the links structure and HTML formatting of source sites. Any changes to those most probably will result in the spider being unable to parse data from the source site and might lead to other scraping issues. This usually involves spider settings reconfiguration.

Can we scrape only specific jobs?

Yes, spider includes powerful filtering tools. For example:

  • filter jobs with specific keywords
  • filter jobs by category, job type etc.
  • filter specific jobs by Ref number or URL
  • scrape only latest jobs i.e. newest 50 jobs
  • and others.

Will the jobs spidered look the same as on the source site and as my other job adverts on destination job board?

Source job adverts might have slightly different paragraph / font formatting to destination board.

Job description is normally spidered as HTML (includes font / paragraph formatting). Destination advert formatting (HTML) normally is identical to source formatting unless cleaned up. But CSS / styles used on destination board can be different to ones sourced. I.e. headings on the source and destination might not look identical whilst the general formatting is preserved.

Plain text info / dropdown listings matched from source website (i.e. job title, employment type, location) will look identical to other job adverts published manually on your destination job board.

Job spider helps clean up source HTML via following options:

  • Remove all HTML tags or keep only some of them (i.e. remove all except br, strong, div)
  • Make conversion from HTML to plain text
  • Replace specific HTML content

How can I sort jobs out on my website if I have predefined lists of states or categories?

Spider tool can match spidered keyword data with your lists / IDs.

As per screenshot above:

  • Source website indicates Employment Type as “Full-Time”.
  • Your job board has another naming of this item – “Permanent” which is the same as “Full time”. Job spider can be set up to match this data and sort out jobs on your job board in a proper way.
  • This means that all “Full-Time” jobs will be posted to “Permanent” section because these two terms are matched in spider settings.

Once the posting expires on the employer’s website, is it automatically removed from my Job Board?

Yes, job spider can be set to “Synchronization” mode:

Job spider revisits jobs on employer website and expires them on your job board once the jobs are removed from source website / URL.

What happens when a candidate applies to a spidered job?

Spider can be set up to save job or apply button URL.

As per your choice / job board application process:

  • This URL can be used for candidate redirect to source website.
  • If job URL is not available – generic Employer application URL can be used.
  • Your job board application process can be utilized instead.

Redirect apply method

Are the applications saved / logged in job board database?

It is based on your job board software settings.

Default setting of JobMount job board software: applications are logged so a link to candidate profile is shown in Employer jobs management / “applications history” interface.

Will candidates be always redirected to a specific job page on the employer website?

Candidates can be redirected to a job (or job-specific application page) only if the employer website navigation / URL structure offers this data.

If source jobs are not located on unique URLs: candidates will be redirected to any default page specified (i.e. a job search form on employer website).

Will employers get application notifications?

Based on your job board software setting.

Default setting of JobMount job board software: yes, employers will get application notifications emailed to addresses specified in their profiles.

How often can the spider revisit an employer website for job updates?

Spidering / posting sessions can be scheduled on hourly, daily or weekly basis. On specific week days and certain times of the day.

Can any website be spidered?

95% of career centers are technically spiderable.

Remaining 5% are the websites which are based on technology that blocks access for either jobs browsing or content retrieval. In this case alternative options can be considered (i.e. XML feed, CSV file, FTP access, etc).

Examples of source sites that are hard or impossible to configure initially:

  • Source website Admin detects and blocks our spider: agreement with the site owner is required
  • Vacancies in protected PDF format. Sometimes can be spidered when job PDF files are not protected by PDF security restrictions and are uniform. Normally takes extra configuration effort
  • HTTPS sites with incorrect certificates
  • Flash based websites (cannot be spidered)
  • Sites without uniform structure where all data is placed without formatting i.e. the page is just manually pasted from some text processor
  • Sites with over 100k jobs could be problematic as it takes too long to download data. XML feed alternative is recommended for such sites. Such sources are to be evaluated first.

How do I set up spider for my JobMount job board software?

Spider auto-posts jobs to your JobMount job board via the Job spider xml interface for bulk posting.

Download XML interface description PDF

Career websites set up (scraping configuration, parsing, matching and posting) is performed by JobMount/SpiderMount teams and does not require any effort on your end.

You will need to provide a list of URLs to employers’ websites & data matching instructions only.

How do I set up spider for my 3rd party software?

Job spider can post jobs via existing bulk posting interface available on your job board (i.e. Broadbean, Idibu)

Alternative options for job posting to 3rd party

Custom posting options can be mapped for target job boards having no standard XML via HTTP interface.

Client target job board can have:

  • unique XML or CSV format mapped (based on target job board database settings),
  • posting organized via FTP/sFTP, email send, grab XML from our location.

See full description on Job Spider Auto Posting Options

in ADVANCED CONFIGURATION

    This site is protected by reCAPTCHA, Google Privacy Policy and Terms of Service apply.