Opportunities in Travel Data Science
The Promise of Big Data
The travel industry has always had a stake in big data – data sets too large for conventional data applications to handle. The difference today is that they have the tools and storage capacity to take advantage of it.
- Every step in a journey – be it the voyage of a traveler or a freight shipment – can now be tracked and recorded.
- Any of these data points, once analyzed, may contribute to greater business intelligence.
In a world where the global travel industry is expected to grow to 10 percent of the global GDP by 2022, greater business intelligence means big bucks.
It’s not just the improved access to internal sources (e.g., booking records, smartphone use, customer profiles, itineraries, complaint and feedback forms, sensor data) that has data scientists perking up. It’s the potential benefits of combining those sources with external data sets (e.g., search, social media, reviews, weather and traffic reports) to solve problems, slash costs and anticipate future events.
Sponsored Master's in Data Science Programs
More InfoSyracuse University
* GRE waivers are available.
More InfoSouthern Methodist University
* GRE waivers available for experienced applicants
More InfoAmerican University
More InfoSyracuse University
More InfoUniversity of California-Berkeley
More InfoSyracuse University
More InfoGeorge Washington University
Truly Personalized Offers
Once upon a time, travel companies used profiling to segment customers into broad categories (e.g., high-income businessperson, family with two children, etc.).
Not anymore. Today, they’re employing big data to create a 360-degree view of each customer. In lieu of generic recommendations, companies are now within reach of the possibility of providing ultra-personalized offers.
This view in the round can be assembled from a number of datasets:
- Behavioral targeting – e.g., website visitor behavior
- Social media – e.g., posts on travel, reviews from friends
- Location tracking records
- Previous purchase patterns
- Mobile device usage data
- Image processing
- Predictive analytics
- And much, much more
Some of these datasets (e.g., behavioral targeting) will be more relevant for new customers; others (e.g., previous purchase patterns) will come into play when companies are looking for ways to cross-sell and up-sell.
- Do you have a prospect in Ohio who is frequently on Facebook visiting pages about Venice? Create a Facebook feed offer with a discount for flights to Italy.
- Do you have information showing that comfortable seating is more important to your passenger than price? Give your frequent flyer a head start on booking an aisle seat to Florida.
The goal is to create the right offer through the right channel at the right time and place.
Enhanced Customer Service
The era of personalization is creating another benefit. It’s giving the transportation industry a chance to improve its customer service.
Take the British Airways “Know Me” program. The airline collates a wide range of passenger data (e.g., travel itinerary, social media, frequent flyer status) as well as personal notes in order to improve front desk and on-board services. Has your bag been lost? You may receive a free upgrade on your next flight.
Or look at Qantas. It’s using iPads to collect passenger data in real-time. Racked up some frequent-flyer points while dining in the terminal prior to your flight? Flight attendants won’t be bothering you with their trays of mystery meat.
Or consider the basic case of KLM. It combed Twitter and Foursquare accounts to determine which customers were taking a KLM flight. Their social media behavior yielded a personality profile that guided the advance purchase of an appropriate gift. Passengers were greeted with the gift on arrival at the airport.
Identifying MVCs (Most Valuable Customers)
Naturally, some customers are going to travel more than others. That means companies need to be acutely aware of the big players so they can avoid customer churn. As Anna Papachristos points out:
“Without a customer valuation program that includes robust value integration and ranking capability, a company can be sidetracked and/or financially challenged in managing the different customer groups and delivering value to each.”
The travel industry already has a giant legacy of data on MVCs from loyalty programs. The trick is combining this historical information with real-time and predictive analytics to anticipate what MVCs will likely want in the future. It costs far more to acquire a new customer than it does to retain an old one.
Up-Selling and Cross-Selling
Let’s say you’re traveling to Buenos Aires for a four-day business conference and you’ve decided to take a weekend to explore the city. You’re looking for a flight that leaves Sunday and returns early Monday.
If your airline has invested in big data, it’s going to be offering you cross-selling and up-selling opportunities from the moment you begin your search.
- Receive a personalized offer in your email box
- Be pre-booked for Economy Plus seating
- Be tempted with discounts on hotels that partner with your airline
- Be offered a complimentary dining coupon from the steward
- Find an ad for weekend city tours appearing on your in-flight entertainment
When companies have the resources to partner with others (e.g., airlines with airports, train and taxi services), cross-selling opportunities can skyrocket.
At Schiphol Airport in The Netherlands, data scientists have developed heat maps to visualize the connection between retail sales and departure gates. This gives them the knowledge to tweak floor layouts to increase foot traffic to these stores.
In the world of travel, complete customer behavior profiles + increased collaboration = more revenue generation.
When it comes to safety, big data can be a literal lifesaver. Today’s planes, trains and automobiles are equipped with a wide array of sensors. These provide control centers with a continuous stream of real-time data on every aspect of the journey (e.g., airmanship or driver behavior, environment, mechanical performance, etc.).
With this information in hand, transportation data scientists are creating complex algorithms to predict problems and, even better, prevent them.
- Critical part wearing down? Replace it before becomes an issue.
- One of your drivers consistently forgetting an important step? Take him/her off the route for re-training.
- Issue that can’t be fixed mid-air? Have a maintenance crew ready with the right tools on arrival.
Not only do these steps slash costs and increase reliability – imperative in a 24/7 industry like travel and transportation – they help avoid catastrophic accidents.
The more efficient your business, the bigger your end-of-year profits. In addition to sensors, data scientists are extracting information from all corners of the digital universe to streamline day-to-day operations.
For example, companies are:
- Integrating data on daily advanced booking trends, in-house historical data and customer behavioral data to create the ultimate “no seat left unfilled” yield management system.
- Pulling data on weather, wind and traffic reports to predict delays and fuel needs, reroute passengers or alter routes.
- Using predictive algorithms to direct drivers with a smartphone to the closest available parking locations. See ParkMe for a demonstration.
Automation has been a massive help in this effort:
- Automatic vehicle locations (AVL) mechanisms provide information to passengers on digital signs and personal apps.
- Automated passenger counting (APC) and fare tools inform transit companies when to increase or decrease their fleet sizes.
In the future, even your trip bookings may be automated. With only a fuzzy idea in its search engine as a start, the travel start-up Hopper is aiming to employ big data algorithms to plan and book your personalized trip. No effort required.
Data Risks and Regulations
The Challenges Ahead
Like many other industries, travel and transportation is grappling with big-data challenges. A mind-boggling volume, variety and velocity of data accumulates daily. The veracity of data is in question. Technology changes by the hour. Trained data scientists are thin on the ground.
And we haven’t even started on:
- Data fragmentation – Remember that 360-degree view? Datasets on customer experience can be scattered among hundreds of places, including baggage, loyalty programs, complaint databases, twitter feeds, flight operations and booking forms.
- Missing, hidden or locked data – Many datasets (e.g., city transit and traffic) are proprietary and private. Others are tricky to access or find. Sometimes what could be the key dataset simply doesn’t exist.
- Implementation costs – As the case study of Crandall and American illustrates, making big data a priority takes time, money and expertise. It’s going to require investment, company-wide.
That’s not all. The airline industry has another problem…
The Burden of Legacy
Ironically, airlines have become victims of their early success. Early investment in big-data technology has left them with clunky and aging operational systems. Some are wrestling with a legacy of reservation software that’s fifty years old.
Start-ups and online travel firms have the luxury of creating centralized data warehouses from scratch. Older companies like airlines must find ways to come up with a hybrid environment where big-data technologies can co-exist with existing hardware and legacy software and databases. It’s not going to be easy.
A Little Privacy, Please
Finally, here’s a cautionary tale from the big data front. In 2012, news sources reported that Orbitz was offering costlier featured deals to Mac users. The reason? Their big-data algorithms had determined that Mac users spend up to 30 percent more on hotels than Window users.
Though all visitors were able to see the same hotel pricing options in search, some Apple customers weren’t exactly thrilled with Orbitz’s predictive analytics coup.
And this wasn’t even close to 360-degree profiling. As companies continue their drive towards data acquisition, questions on individual privacy will continue to arise:
- Who owns travel data? How often can it be shared with third parties?
- Which privacy rules apply for global conglomerates? Are they obeying the laws of local jurisdictions? The E.U., in particular, has strict privacy legislation.
- When does “true personalization” become a creepy invasion of privacy?
Travel companies, which often store sensitive credit information, must also be vigilant about digital safety and security measures. The more data they collect on individual passengers, the more attractive they become to cyber-criminals.
History of Data Analysis, Travel and Transportation
“You put together the best team that you can with the players you’ve got, and replace those who aren’t good enough.” – Robert Crandall
First there were horse-drawn carriages. Then there were trains. Then there were planes. At each step along the evolution of transportation, there was a surge in data collection – reservations, arrivals and departures, cargo capacities – and in savvy entrepreneurs, to make the most of it all.
- In 1841, Thomas Cook, the first modern travel agent, offered members of his temperance society a 1-shilling package deal for train ticket and lunch.
- In 1907, Jim Casey borrowed $100 from a friend to create the American Messenger Company, the forerunner of UPS.
- In 1934, E.L. Cord acquired American Airways, renamed it American Airlines and hired C.R. Smith to run it.
And it is with C.R. Smith and American Airlines that we start our story of computerized data analysis.
Is That Seat Reserved?
In the 1940s, American Airlines had a big problem. Despite giant leaps forward in technology during World War II, airline employees were still encumbered with a manual booking process. Finding a seat for a customer, updating the inventory and recording passenger data could take airline agents up to 90 minutes. Per booking.
Fortunately, American had hired engineer and data expert Charles Amman to work on a solution. In 1944, he mocked up an automated system for storing seat inventory and showed his idea to Smith. Approved, the idea was then implemented by the Teleregister Company of Stamford, Connecticut.
Together, they built the Reservisor, an electromechanical flight board that could broadcast seat availability. It first went live in February 1946.
A few years later, in 1952, Amman and his colleagues updated the system with drum memory technology. The new Magnetronic Reservisor allowed for direct manipulation of available seats and could store data for up to 1,000 flights, 10 days into the future. It was subsequently joined by Reserwriter, which recorded passenger information.
Mr. and Mr. Smith: Speed in Booking
There was one weak point left in the Reservisor system. It still depended on manual input from multiple employees. Booking a flight – in an era where passenger volume was exploding – could take as long as three hours.
Trans-Canada Airlines (TCA) had developed a system, ReserVec, in 1953 that was working much more efficiently, but it couldn’t break into the U.S. market.
Time for a little serendipity. That same year, on a flight from Los Angeles to New York, C.R. Smith met Blair Smith, an IBM sales executive (they bonded over names) and asked him if IBM’s computer experts couldn’t do something to help.
The result? IBM and American developed the Semi-Automatic Business Research Environment (SABRE). Sabre was built on teleprinter technology initially developed for the U.S. Air Force. With IBM’s electronic brain in place, American’s reservation process became a thing of automated beauty.
Fully operational in 1964, Sabre was the largest private, real-time data processing system in the world. It was quickly joined by Deltamatic, PANAMAC, Apollo and PARS.
Welcome, Frequent Flyers!
Not long afterward, airlines started experimenting with the idea of loyalty programs. The Airline Deregulation Act of 1978 was creating cutthroat competition. Companies needed to stand out.
- In 1972, United Airlines began handing out plaques and promotional materials.
- In 1979, Texas International Airlines pioneered the use of mileage tracking to grant rewards.
- And in 1981, American launched the AAdvantage program.
AAdvantage was a case of big data in action. Data analysts combed Sabre’s computer reservation system for recurring phone numbers to detect frequent flyers. These folks were pre-enrolled in the program and offered a variety of benefits – complimentary upgrades, free or discounted tickets, etc.
Loyalty programs later expanded to include hotels, rental car agencies and other carriers. Airlines were now swimming in data about their customers’ travel habits.
Meanwhile, as the seventies slid into the eighties…
- The cost of desktop computers was dropping rapidly.
- Reservation terminals were appearing in travel agencies.
In 1987, European airlines including Air France and Lufthansa developed Amadeus, a global distribution system that could compete with Sabre.
Yield Management Systems For Greater Efficiency
Deregulated fare and schedule structures, a shift to a hub and spoke strategy (with more connecting flights) and increased competition from new low cost carriers like People Express Airlines had another effect in the 80s.
It prompted Robert Crandall of American to set up the first automated yield management system, and many more to follow him.
A yield management system helps airlines provide variable pricing for a fixed resource (e.g., seats) in order to maximize their revenue. In the best-case scenario, this analytics-based inventory control system is capable of:
- Determining how many seats to allocate to each fare category
- Adjusting seat allocations as a travel date nears
- Taking into account bookings for connecting flights
- Offering discounts when there is a surplus of empty seats
- Keeping seats available for full-fare customers (e.g., businesspeople) who book at the last minute
It can also estimate demand and cancellation rates.
Though it took some time to develop, American’s Dynamic Inventory and Maintenance Optimizer (DINAMO) was fully implemented in 1988. It allowed American to undercut the competition and increase productivity per analyst by 30 percent in two years.
The Democratization of Travel
The rumblings had been happening for some time. As far back as 1985, Sabre had launched easySabre®, which provided online access to bookings for airlines and hotels.
But it was in the 1990s that worldwide access to data truly exploded. In 1991, UPS deployed its handheld Delivery Information Acquisition Device (DIAD), delivering information back to headquarters and marking a milestone in strategic use of data. In 1995, customers gained the ability to track their packages in transit via UPS’s website.
In 1996 alone:
- Sabre debuted its website Travelocity, which cut out the travel agent middleman and put reservations directly in the hands of customers.
- Microsoft launched Expedia, with online bookings for airlines, hotels and car rentals.
Airlines were well aware of the changes. In 1999, they partnered to develop industry standards for the electronic exchange of business information. Two years later, in 2001, five of the six major carriers launched Orbitz.
As search technologies grew in sophistication, so too did websites. In 2004, cofounders of Orbitz, Expedia and Travelocity unveiled Kayak.com. Kayak’s “metasearch” model gave users a central place to search for and compare airline prices across multiple sites.
Meanwhile, the cost of data storage and processing was dropping rapidly. Mobile technologies were coming online. Social media blossomed. The travel and transportation industry started to take a serious look at the promise of big data.