Extract Travel Data with Mass Scale Travel Feed Extraction

The travel industry is ever changing with exa-bytes of data being updated on a daily (sometimes hourly) basis.

E-mail: sales@promptcloud.com

For organizations operating is such a dynamic industry, access to accurate and structured data at the right time becomes of utmost importance. Here is a use case of our client, where the requirement was to extract travel data daily. Read on to know more:

Client Details

A well-known budget hotel chain who wants to be updated about the hotel prices across India twice a day, to be able to strategize better in terms of pricing as well improve their offerings in general. With ever-growing competition, access to such data becomes a priority to stand out in the industry!

Crawl requirements: Travel Data Daily Required

The client had a specific set of requirements:

Check-in Check out dates to be specified by them
Such data was to be uploaded to a file sharing server and our crawlers were to pick them up at predefined times and process them. Possible situations:
File available in the morning but not in the afternoon
File not available in the morning but available in the afternoon
File available both in the morning and afternoon
No file uploaded in the morning or afternoon
Fields for extraction were predefined and in a set order as specified by the client
Crawl frequency was twice in a day

Target Sites and Approximate Monthly Volumes

Most of the major OTAs (Online Travel Agents) in India with a volume of around 30 million records per month. Approximately 300,000 to 400,000 records per site per day were delivered based on the files uploaded by the client.

Related Use Cases

Use Case

Large-Scale Price Data Extraction From Hotel Booking Portals

Web Scraping Hotel Prices On Daily Basis

Get Vacation Rental Data Across The Sites

We programmed the crawlers to search for the files on the sharing server at a pre-decided time and pick them up, if available. The crawler would check for files once in the morning and once in the afternoon.
Additional scripts were written and additional resources were made use of to ensure data delivery happens before 2359 hrs of a particular day.
The crawlers were programmed to detect if the currency was in INR. If not INR, the crawlers were programmed to change the currency to INR. This was important when servers from across the world were being used to crawl the data.
The crawlers were also programmed to ensure not to hit the target servers very aggressively to avoid being blocked while, at the same time, ensuring that all the necessary data was captured before 2359 hrs.

Benefits delivered

Regular & flexible access to the required data as required. Since the crawler picks up data only if available, the client had the flexibility to upload the files only if required.
Considering a dedicated team at the client side who were directly involved with this activity, a cost savings of about 23% was achieved by them
With a bimonthly inventory crawl, they had access to updated data regularly (fortnightly) and were in a better position to increase their footprint across the country.
With a low turnaround time, the data extracted could be used more effectively.
Except for the initial onboarding period, the process was completely automated. Any disruption in service was also automatically updated to the support team to ensure that the crawls run in a smooth manner.

Client Details

Crawl requirements: Travel Data Daily Required

Target Sites and Approximate Monthly Volumes

Related Use Cases

Use Case

Large-Scale Price Data Extraction From Hotel Booking Portals

Use Case

Web Scraping Hotel Prices On Daily Basis

Use Case

Get Vacation Rental Data Across The Sites

Approaches Used to tackle the issues

Benefits delivered

RECOMMENDED

Content Extraction Service

Walmart Price Scraper Listings

Classifieds Scraper For Listings

Are you looking for a custom data extraction service?

Solutions

Use cases

Resources

Newsletter

Get Travel Data Daily with Mass Scale Travel Feed Extraction

Client Details

Crawl requirements: Travel Data Daily Required

Target Sites and Approximate Monthly Volumes

Related Use Cases

Use Case

Large-Scale Price Data Extraction From Hotel Booking Portals

Use Case

Web Scraping Hotel Prices On Daily Basis

Use Case

Get Vacation Rental Data Across The Sites

Approaches Used to tackle the issues

Benefits delivered

RECOMMENDED

Content Extraction Service

Walmart Price Scraper Listings

Classifieds Scraper For Listings

Are you looking for a custom data extraction service?