Zyte Data job posting schema v1.0#

Standard Job Posting Schema (1.0)

Standard Job Posting Schema used in Zyte offering. Covers the typical set of attributes present in job postings published on-line.

Standard Job Posting Schema v1.0

Responses

Response Schema: application/json
url
required
string (URL)

The URL of the final response, after any redirects.

Required attribute.

In case there is no job posting data on the page or the page was not reached, the returned "empty" item would still contain url field and metadata field with dateDownloaded.

jobPostingId
string

The unique identifier of the job posting.

datePublished
string

Publication date of the job posting.

Format: ISO 8601 format: "YYYY-MM-DDThh:mm:ssZ" With timezone, if available. If the actual publication date is not found, "dateModified" value is taken.

datePublishedRaw
string

Same date as "datePublished", but before parsing/normalization, i.e. as it appears on the website.

dateModified
string

The date when the job posting was most recently modified.

Format: ISO 8601 format: "YYYY-MM-DDThh:mm:ssZ" With timezone, if available.

dateModifiedRaw
string

Same date as "dateModified" but before parsing/normalization, i.e. as it appears on the website.

validThrough
string

The date after which the job posting is not valid, e.g. the end of an offer.

Format: ISO 8601 format: "YYYY-MM-DDThh:mm:ssZ" With timezone, if available.

validThroughRaw
string

Same date as "validThrough" but before parsing/normalization, i.e. as it appears on the website.

jobTitle
string

The title of the job.

headline
string

The headline of the job posting.

object

A (typically single) geographic location associated with the job position.

raw
string

The job location string as it appears on the website.

description
string

A description of the job posting including sub-headings, with newline separators.

Format:

  • trimmed (no whitespace at the beginning or the end of the description string),
  • line breaks included,
  • no length limit,
  • no normalization of Unicode characters.
descriptionHtml
string

Simplified HTML of the description, including sub-headings, image captions and embedded content.

employmentType
string

Type of employment (e.g. full-time, part-time, contract, temporary, seasonal, internship).

object

The base salary of the job or of an employee in the proposed role.

raw
string

Salary amount as it appears on the website.

valueMin
string

The minimum value of the base salary as a number string.

valueMax
string

The maximum value of the base salary as a number string. In case of only one value given for the salary instead of a range, valueMax is used to represent it.

rateType
string

The type of the salary rate, e.g. hourly, weekly, monthly, yearly.

currencyRaw
string

Currency associated with the salary amount, without normalization.

currency
string

Currency associated with the salary amount.

ISO 4217 standard.

requirements
Array of strings

Candidate requirements for the job.

object

Information about the organization offering the job position.

name
string

Name of the organization.

id
string

The unique identifier of the organization.

jobStartDate
string

Job start date.

Format: ISO 8601 format: "YYYY-MM-DD"

jobStartDateRaw
string

Same date as "jobStartDate" but before parsing/normalization, i.e. as it appears on the website.

remoteStatus
string
Enum: "Remote" "OnSite" "Hybrid"

Specifies the remote status of the position.

Extracted only if specified explicitly in a separate data point, not from job description.

object

Contains metadata about the data extraction process.

dateDownloaded
string

The timestamp at which the job posting data was downloaded.

Format: ISO 8601 format: "YYYY-MM-DDThh:mm:ssZ"

probability
float

The probability that the page belongs to certain data type.

Between 0 and 1.0. Always 1.0 for manual extraction.

searchText
string

The search text the post was found with.

Response samples

Content type
application/json
{
}