Search by Algolia
Add InstantSearch and Autocomplete to your search experience in just 5 minutes
product

Add InstantSearch and Autocomplete to your search experience in just 5 minutes

A good starting point for building a comprehensive search experience is a straightforward app template. When crafting your application’s ...

Imogen Lovera

Senior Product Manager

Best practices of conversion-focused ecommerce website design
e-commerce

Best practices of conversion-focused ecommerce website design

The inviting ecommerce website template that balances bright colors with plenty of white space. The stylized fonts for the headers ...

Catherine Dee

Search and Discovery writer

Ecommerce product listing pages: what they are and how to optimize them for maximum conversion
e-commerce

Ecommerce product listing pages: what they are and how to optimize them for maximum conversion

Imagine an online shopping experience designed to reflect your unique consumer needs and preferences — a digital world shaped completely around ...

Vincent Caruana

Senior Digital Marketing Manager, SEO

DevBit Recap: Winter 2023 — Community
engineering

DevBit Recap: Winter 2023 — Community

Winter is here for those in the northern hemisphere, with thoughts drifting toward cozy blankets and mulled wine. But before ...

Chuck Meyer

Sr. Developer Relations Engineer

How to create the highest-converting product detail pages (PDPs)
e-commerce

How to create the highest-converting product detail pages (PDPs)

What if there were a way to persuade shoppers who find your ecommerce site, ultimately making it to a product ...

Vincent Caruana

Senior Digital Marketing Manager, SEO

Highlights from GopherCon Australia 2023
engineering

Highlights from GopherCon Australia 2023

This year a bunch of our engineers from our Sydney office attended GopherCon AU at University of Technology, Sydney, in ...

David Howden
James Kozianski

David Howden &

James Kozianski

Enhancing customer engagement: The role of conversational commerce
e-commerce

Enhancing customer engagement: The role of conversational commerce

Second only to personalization, conversational commerce has been a hot topic of conversation (pun intended) amongst retailers for the better ...

Michael Klein

Principal, Klein4Retail

Craft a unique discovery experience with AI-powered recommendations
product

Craft a unique discovery experience with AI-powered recommendations

Algolia’s Recommend complements site search and discovery. As customers browse or search your site, dynamic recommendations encourage customers to ...

Maria Lungu

Frontend Engineer

What are product detail pages and why are they critical for ecommerce success?
e-commerce

What are product detail pages and why are they critical for ecommerce success?

Winter is coming, along with a bunch of houseguests. You want to replace your battered old sofa — after all,  the ...

Catherine Dee

Search and Discovery writer

Why weights are often counterproductive in ranking
engineering

Why weights are often counterproductive in ranking

Search is a very complex problem Search is a complex problem that is hard to customize to a particular use ...

Julien Lemoine

Co-founder & former CTO at Algolia

How to increase your ecommerce conversion rate in 2024
e-commerce

How to increase your ecommerce conversion rate in 2024

2%. That’s the average conversion rate for an online store. Unless you’re performing at Amazon’s promoted products ...

Vincent Caruana

Senior Digital Marketing Manager, SEO

How does a vector database work? A quick tutorial
ai

How does a vector database work? A quick tutorial

What’s a vector database? And how different is it than a regular-old traditional relational database? If you’re ...

Catherine Dee

Search and Discovery writer

Removing outliers for A/B search tests
engineering

Removing outliers for A/B search tests

How do you measure the success of a new feature? How do you test the impact? There are different ways ...

Christopher Hawke

Senior Software Engineer

Easily integrate Algolia into native apps with FlutterFlow
engineering

Easily integrate Algolia into native apps with FlutterFlow

Algolia's advanced search capabilities pair seamlessly with iOS or Android Apps when using FlutterFlow. App development and search design ...

Chuck Meyer

Sr. Developer Relations Engineer

Algolia's search propels 1,000s of retailers to Black Friday success
e-commerce

Algolia's search propels 1,000s of retailers to Black Friday success

In the midst of the Black Friday shopping frenzy, Algolia soared to new heights, setting new records and delivering an ...

Bernadette Nixon

Chief Executive Officer and Board Member at Algolia

Generative AI’s impact on the ecommerce industry
ai

Generative AI’s impact on the ecommerce industry

When was your last online shopping trip, and how did it go? For consumers, it’s becoming arguably tougher to ...

Vincent Caruana

Senior Digital Marketing Manager, SEO

What’s the average ecommerce conversion rate and how does yours compare?
e-commerce

What’s the average ecommerce conversion rate and how does yours compare?

Have you put your blood, sweat, and tears into perfecting your online store, only to see your conversion rates stuck ...

Vincent Caruana

Senior Digital Marketing Manager, SEO

What are AI chatbots, how do they work, and how have they impacted ecommerce?
ai

What are AI chatbots, how do they work, and how have they impacted ecommerce?

“Hello, how can I help you today?”  This has to be the most tired, but nevertheless tried-and-true ...

Catherine Dee

Search and Discovery writer

Looking for something?

facebookfacebooklinkedinlinkedintwittertwittermailmail

Search indexing is often the first topic we discuss with customers when starting a new business engagement. Whether it’s a large enterprise-scale site or a small ecommerce store, the first step to adding search to a website is indexing your content through a website crawler or API. Your site’s architecture, schema, and content can all affect indexing.

In this article, we’ll cover a lot of the topics we discuss with our customers and share some actionable tips for improving a search index.

Note: this is not an article about SEO. While on-site search optimization and SEO are related — the work you do to optimize your search index for on-site search also helps with Google search or Bing visibility — they address different needs. SEO is geared towards internet visibility, whereas on-site search addresses user experience. However, the XML sitemaps, internal links, meta tags, etc., you create for one will help the other!

What is a search index?

A search index helps users quickly find information on a website. It is designed to map search queries to web pages, documents, or other site content. It’s analogous to an index in a book. It allows the user to quickly find useful information using keywords, but has many technological advantages over the ones in books such as helping visitors find what they want faster. Search indexes can be created either through web crawlers or via API access, but both have their benefits for different situations.

What is full-text search?

Full-text search entails indexing each word on your site in order to make search engine navigation through many records easy. Traditionally, full-text search engines used an “inverted index” — essentially, a map of all the keywords in your document and the locations of those keywords.

full text search

In the example above, the keywords “portable” and “sound” aren’t in the index, but an AI-powered search engine understands context to deliver great results.

AI-powered search engines can now go beyond keywords to understand context to provide richer results. Take the query “portable sound” as an example. If a keyword based search engine has the terms “portable” and “speaker” in the index, the results page may include the correct item. With machine learning, you can get good results even if the keywords aren’t on the site by detecting context and similarities between words. A machine can learn, for example, that the word “portable” is similar to “handheld”, “mobile”, and “phone”, all of which are near in meaning, but not necessarily synonymous.

Search crawlers and APIs

There are two primary ways to build a search engine index — a search crawler or directly pulling data from a database via APIs. Each of these has benefits for different situations.

For example, for most static websites, a crawler is fine. It’s fast and comprehensive. API-driven indexing is ideal for sites with dynamic or constantly changing data. APIs have their own set of advantages such as the ability to quickly add new data sources.

What is fast indexing?

When you add new content or change existing content, you want results to be searchable in real time. Fast indexing is a must have for retailers and brands selling new products or launching campaigns. On occasion, when our customers have problems with fast indexing, it’s typically due to an issue such as:

  • Content isn’t getting indexed fast enough due to complex architecture of an API issue
  • Content is in the index, but not getting displayed in results
  • PDF and DOC files fail to index

Most problems can be resolved relatively quickly. The first thing to do is check how the crawler views your website documents, of if your data pipeline is blocking. Using a sitemap.xml file to assist the crawler is always a good practice and can help with getting your content indexed quickly. If you’re indexing your site via API, it’s likely that there is an integration issue that needs to be resolved. 

To help with all this, and to simplify the indexing process, we offer API clients in many programming languages, dashboards that hlpe you visualize the index and crawling processes, and an CLI tools for interacting with the API in a variety of convenient ways.

12 ways to optimize and enrich your search index

Whether you are using a search crawler or connecting your site via API, there are many ways to configure and improve a search index. The real-world suggestions below come directly from the conversations we often have with customers who are building their index via crawler or API. Some of these methods are more appropriate for crawler-based index, others are relevant to API-indexing, and a few are relevant for both.

Here are 12 ways you can optimize your search index:

1. Open Graph metadata

Facebook released their Open Graph protocol in 2010 and since then it has become widely used by search engines. Search results often include an image preview, and most often this is powered by Open Graph.

By adding open graph tags to your content you can improve a search index with information such as:

  • Title with type of content
  • Image and URL
  • Add additional open graph data

There are heaps of other data you can use with Open Graph to enrich a search index besides just title, description, and images, but many people don’t know or use them all. For more information, visit https://ogp.me/

2. Schema.org formats

Open Graph is just one of several open protocols for enriching web and search engine indexing data. There are different kinds of schemas you can mark up your page content with. For example, if you’re a recipe site, you will have different standards for how you mark up content than, say, an event website.

Schema.org publishes and maintains different schema vocabulary for different kinds of sites. For example, for events, such as a concert, lecture, or festival, ticketing information may be added via the markup in HTML (or JSON-LD) format like <a class=”localLink” href=”/offers”>offers</a> property. Repeated events may be structured as separate Event objects.

3. Article publish and modified times

The article publish and article modified dates/times are super important for being able to sort content by recency. The time stamps are supported both within open graph or schema.org formats.

article:published_time – datetime – When the article was first published.
article:modified_time – datetime – When the article was last changed.

4. Identify header and footer content

Miscellaneous content such as your nav, footer, and anything not specific to the page, should be within the header and footer tag so search engines know to ignore it. By marking up the header and footer content, you give the search engine a better chance of understanding what the page is about so it can be indexed properly — in this case, navigational data vs body data.

5. Augmenting your search index

Search indexes can be enriched with data in a variety of ways such as:

  • Adding color metadata via the Google Vision API
  • Using third-party data such as product ratings
  • Extract incoming data to be used for creating filters and facets

As new information is added to the index, data may be enhanced. This data is utilized by search engines in order to provide better results and make it simpler for consumers to locate what they are looking for faster. Ecommerce sites frequently update their items on a regular basis, and the enriched data can be incorporated during updates.

6. Business performance data

Your index is more than your content. Off-site data, such as product ratings, margins, inventory levels, etc., can be very useful for a search index to assist with result ranking. There may be many products which are relevant to a customer searching your site, but your business data can be used to enhance results to ensure the best ones are pushed to the top. We offer custom ranking and boosting that help customers build conversion flywheels using this kind of business data.

7. Merchandising and Campaign data

Many retailers run quarterly, seasonal, or holiday sales. By adding merchandising and campaign data to your site index, you can adjust results to display sale items.

You could add a specific sales field or use a discount field to calculate when there is a sale. In the latter case, the search engine will know that your display price is lower than your regular price, which can be helpful for sorting on discounted items to help visitors find the best savings. You can also then use an algorithm (via our ranking formula) to give different items a boost based on their sales status or other properties.

merchandising

The search index should include fields and data that can be used for building filters and facets

8. Filters

Search filters and facets can be built using your search index. We can infer and create filters automatically (with Query Categorization, for instance), but you can also design custom filters when needed. Determining the best filters to offer comes down to understanding your customers and how they want to slice and dice your products. Check out our guide on filters and facets for more.

9. Content type

There are different meta tags available to help a search index understand content by type. Is the content going to take visitors to a video, a document, page, or something else? Use HTML or JSON-LD tags to identify your content as a video, audio, abstract, etc., to help your search index sort or filter content by type.

10. Personalization

Customers expect and want search results to be personalized. If you offer free shipping for members, that information should be in the data. If there’s a discount by location, then you’ll want to have geo data in your records, too. By connecting your search index to this data, you can easily personalize search results

11. Integration with other third-party systems

Big businesses often have complex infrastructure with data coming from various systems. Need to integrate data with your supply chain management or PIM? You’ll want your search solution to support an API to enable instant indexing of data between systems.

12. Review your analytics and search metrics

Site owners should plan to spend some amount of time reviewing their analytics and search metrics to identify the keywords customers are querying. Understanding how customers search can help identify opportunities to enrich the index, add or adjust filters, and improve search engine results.

It’s all about indexing great content

Building a rich search index can greatly improve search performance and customer satisfaction. By understanding the different types of data that can be included, site owners can make sure they are providing the best possible search experience for their customers.

To learn more about how to set up your search index or take advantage of our personalization and custom ranking features, contact us today! We offer a free trial and demo so you can explore all that our solution has to offer.

About the author
Jon Silvers

Director, Digital Marketing

linkedin

Recommended Articles

Powered byAlgolia Algolia Recommend

What is a search index and how does it work?
product

Adam Smith

Sr. Director, Digital Marketing

12 Tips to Increase Ecommerce Conversion Rates
e-commerce

Reshma Iyer

Director of Product Marketing, Ecommerce

The (almost) ultimate guide to site search
product

Ivana Ivanovic

Senior Content Strategist