Trigger a recrawl of a website document to fetch fresh content
crawl_count is incremented and last_crawled_at is updatedready or error statecrawl configuration, or you must provide one in the request bodycrawl object in the request body to override the stored configuration for this recrawl:
crawl object is provided, the original crawl configuration from document creation is used.
recrawlingcallback_url during document creation, you’ll receive status updatesready (or error if it failed)The unique identifier of the document to recrawl
Optional crawl configuration to override the stored settings. If not provided, the original crawl configuration will be used.
Recrawl initiated successfully
Unique identifier for the document
"d8-5c71baca86fc"
Name of the document
"Company Website"
URL of the document
"https://example.com/"
Current status of the document (will be 'recrawling')
"recrawling"
Progress indicator for document processing
null
ISO 8601 timestamp of when the document was created
"2024-01-01T12:00:00Z"
ISO 8601 timestamp of when the document was last updated
"2024-01-15T10:30:00Z"
URL that will receive status updates
"https://your-server.com/webhook"
Array of document tags
["website", "company"]Additional document properties
{}The crawl configuration being used for the recrawl
List of URLs from the previous crawl (will be updated when recrawl completes)
[
"https://docs.example.com/",
"https://docs.example.com/getting-started"
]ISO 8601 timestamp of the previous crawl
"2024-01-01T12:05:00Z"
Number of times the document has been crawled (will increment when recrawl completes)
1