How Airbnb makes use of visible attributes to reinforce the Visitor and Host expertise
By Yuanpei Cao, Bill Ulammandakh, Hao Wang, and Tony Hwang
On Airbnb, our hosts share distinctive listings everywhere in the world. There are tons of of thousands and thousands of accompanying itemizing pictures on Airbnb. Itemizing pictures comprise essential details about type and design aesthetics which might be tough to convey in phrases or a set record of facilities. Accordingly, a number of groups at Airbnb are actually leveraging pc imaginative and prescient to extract and incorporate intangibles from our wealthy visible knowledge to assist friends simply discover listings that swimsuit their preferences.
In earlier weblog posts titled WIDeText: A Multimodal Deep Studying Framework, Categorizing Itemizing Photographs at Airbnb and Amenity Detection and Past — New Frontiers of Laptop Imaginative and prescient at Airbnb, we explored how we make the most of pc imaginative and prescient for room categorization and amenity detection to map itemizing pictures to a taxonomy of discrete ideas. This submit goes past discrete classes into how Airbnb leverages picture aesthetics and embeddings to optimize throughout varied product surfaces together with advert content material, itemizing presentation, and itemizing suggestions.
Engaging pictures are as very important as value, critiques, and outline throughout a visitor’s Airbnb search journey. To quantify “attractiveness” of pictures, we developed a deep learning-based picture aesthetics evaluation pipeline. The underlying mannequin is a deep convolutional neural community (CNN) educated on human-labeled picture aesthetic ranking distributions. Every picture was rated on a scale from 1 to five by tons of of photographers based mostly on their private aesthetic measurements (the upper the ranking, the higher the aesthetic). Not like conventional classification duties that classify the picture into low, medium and high-quality classes, the mannequin was constructed upon the Earth Mover’s Distance (EMD) because the loss perform to foretell photographers’ ranking distributions.
The expected imply ranking is extremely correlated with picture decision and itemizing reserving chance, in addition to high-end Airbnb itemizing picture distribution. Ranking thresholds are set based mostly on use instances, corresponding to advert picture suggestion on social media and picture order suggestion within the itemizing onboarding course of.
Airbnb makes use of promoting on social media to draw new prospects and encourage our group. The social media platform chooses which adverts to run based mostly on thousands and thousands of Airbnb-provided itemizing pictures.
Since a visually interesting Airbnb picture can successfully entice customers to the platform and significantly enhance the advert’s click-through price (CTR), we utilized the picture aesthetic rating and room categorization to pick out essentially the most engaging Airbnb pictures of the lounge, bed room, kitchen, and exterior view. The criterion for “good high quality” itemizing pictures was set based mostly on the highest fiftieth percentile of the aesthetic rating and tuned based mostly on an inner handbook aesthetic analysis of 1K randomly chosen itemizing cowl pictures. We carried out A/B testing for this use case and located that the advert candidates with a better aesthetic rating generated a considerably increased CTR and reserving price.
When posting a brand new itemizing on Airbnb, hosts add quite a few pictures. Optimally arranging these pictures to spotlight a house might be time-consuming and difficult. A bunch may be unsure concerning the very best association for his or her photos as a result of the work requires making trade-offs between picture attractiveness, picture range, and content material relevance to friends. Extra particularly, the primary 5 pictures are an important for itemizing success as they’re essentially the most incessantly considered and essential to forming the preliminary visitor impression. Accordingly, we developed an automatic picture rating algorithm that selects and orders the primary 5 pictures of a house leveraging two visible alerts: house design analysis and room categorization.
Residence design analysis estimates how nicely a house is designed from an inside design and structure perspective. The CNN-based house design analysis mannequin is educated on Airbnb Plus and Luxe qualification knowledge that assess the aesthetic enchantment of every picture’s house design. Airbnb Plus and Luxe listings have handed strict house design analysis standards and so the info from their qualification course of is well-suited for use as coaching labels for a house design analysis mannequin. The pictures are then labeled into totally different room sorts, corresponding to lounge, bed room, toilet and so on, by the room categorization mannequin. Lastly, an algorithm makes trade-offs between picture house design attractiveness, picture relevance, and picture range to maximise the reserving chance of a house. Under is an instance of how a brand new picture order is recommended. The picture auto-rank characteristic was launched in Host’s itemizing onboarding product in 2021, resulting in important lifts in new itemizing creation and reserving success.
Authentic ordering
Auto-suggested ordering
Past aesthetics, pictures additionally seize the overall look and content material. To effectively symbolize this info, we encode and compress pictures into picture embeddings utilizing pc imaginative and prescient fashions. Picture embeddings are compact vector representations of photos that symbolize visible options. These embeddings might be in contrast towards one another with a distance metric that represents similarity in that characteristic house.
The options discovered by the encoder are instantly influenced by the coaching picture knowledge distribution and coaching goals. Our labeled room sort and amenity classification knowledge permits us to coach fashions on this knowledge distribution to provide semantically significant embeddings for itemizing picture similarity use instances. Nevertheless, as the amount and variety of photos on Airbnb develop, it turns into more and more untenable to rely solely on manually labeled knowledge and supervised coaching strategies. Consequently, we’re presently exploring self-supervised contrastive coaching to enhance our picture embedding fashions. This type of coaching doesn’t require picture labels; as a substitute, it bootstraps contrastive studying with synthetically generated optimistic and unfavourable pairs. Our picture embedding fashions can then study key visible options from itemizing pictures with out handbook supervision.
It’s typically impractical to compute exhaustive pairwise embedding similarity, even inside centered subsets of thousands and thousands of things. To assist real-time search use instances, corresponding to (close to) duplicate picture detection and visible similarity search, we as a substitute carry out an approximate nearest neighbor (ANN) search. This performance is basically enabled by an environment friendly embedding index preprocessing and building algorithm known as Hierarchical Navigable Small World (HNSW). HNSW builds a hierarchical proximity graph construction that enormously constrains the search house at question time. We scale this horizontally with AWS OpenSearch, the place every node accommodates its personal HNSW embedding graphs and Lucene-backed indices which might be hydrated periodically and might be queried in parallel. So as to add real-time embedding ANN search, now we have applied the next index hydration and index search design patterns enabled by present Airbnb inner platforms.
To hydrate an embedding index on a periodic foundation, all related embeddings computed by Bighead, Airbnb’s end-to-end machine studying platform, are aggregated and endured right into a Hive desk. The encoder fashions producing the embeddings are deployed for each on-line inference and offline batch processing. Then, the incremental embedding replace is synced to the embedding index on AWS OpenSearch by Airflow, our knowledge pipeline orchestration service.
To carry out picture search, a consumer service will first confirm whether or not the picture’s embedding exists within the OpenSearch index cache to keep away from recomputing embeddings unnecessarily. If the embedding is already there, the OpenSearch cluster can return approximate nearest neighbor outcomes to the consumer with out additional processing. If there’s a cache miss, Bighead known as to compute the picture embedding, adopted by a request to question the OpenSearch cluster for approximate nearest neighbors.
Following this embedding search framework, we’re scaling real-time visible search in present manufacturing flows and upcoming releases.
Airbnb Categories assist our friends uncover distinctive getaways. Some examples are “Superb views”, “Historic properties”, and “Inventive areas”. These classes don’t at all times share widespread facilities or discrete attributes, as they typically symbolize an inspirational idea. We’re exploring computerized class growth by figuring out related listings based mostly on their pictures, which do seize design aesthetics.
Within the 2022 Summer season Launch, Airbnb launched rebooking help to supply friends a clean expertise from Neighborhood Help ambassadors when a Host cancels on quick discover. For the aim of recommending comparable listings all through the rebooking course of, a two-tower reservation and itemizing embedding mannequin ranks candidate listings, up to date each day. As future work, we will contemplate augmenting the itemizing illustration with picture embeddings and enabling real-time search.
Photographs comprise aesthetic and style-related alerts which might be tough to precise in phrases or map to discrete attributes. Airbnb is more and more leveraging these visible attributes to assist our hosts spotlight the distinctive character of their listings and to help our friends in discovering listings that match their preferences.
Eager about working at Airbnb? Take a look at our open roles.
Due to Teng Wang, Regina Wu, Nan Li, Do-kyum Kim, Tiantian Zhang, Xiaohan Zeng, Mia Zhao, Wayne Zhang, Elaine Liu, Floria Wan, David Staub, Tong Jiang, Cheng Wan, Guillaume Man, Wei Luo, Hanchen Su, Fan Wu, Pei Xiong, Aaron Yin, Jie Tang, Lifan Yang, Lu Zhang, Mihajlo Grbovic, Alejandro Virrueta, Brennan Polley, Jing Xia, Fanchen Kong, William Zhao, Caroline Leung, Meng Yu, Shijing Yao, Reid Andersen, Xianjun Zhang, Yuqi Zheng, Dapeng Li, and Juchuan Ma for the product collaborations. Additionally thanks Jenny Chen, Surashree Kulkarni, and Lauren Mackevich for modifying.
Due to Ari Balogh, Tina Su, Andy Yasutake, Pleasure Zhang, Kelvin Xiong, Raj Rajagopal, and Zhong Ren’s management assist on constructing pc imaginative and prescient merchandise at Airbnb.