Environment: All
Hawksearch Versions: V4.0 and V4.0L
The Facets Followed by Spiders section of Hawksearch SEO Influencer allows you to specify which facets should be crawled by spiders and the depth of the facets (only for nested scenarios) that should be crawled.
Our SEO Influencer tool currently doesn’t limit how many facet values per facet should be considered for sitemap creation. This was not an issue as most of our customers don’t have a long list of facet values. However, one of our newer clients, PCAOB, has thousands of facet values and sitemap creation process for this customer gets hung.
With February 2022 release we have made the following changes:
UI changes:
Add a Max # of Values per Facet column that will allow to limit the number of facet values from each facet that we will use for sitemap building.
Please note: this column should not be editable for clients and would be a read/display only. Also, a system parameter was created to internally control the limit.
2. Changes that Impact Different Facet Types:
a. Non-nested facets
For non-nested facets the max # of facet values per facet should be applied at the facet level.
E.g. if max # of facet values per facet for Color facet is 5, we will only use the first 5 facet values for sitemap building.
Let’s look at Vendor Name facet on Reinders page as an example:
Only the first 5 facet values will be used for the sitemap building:
b. Nested facets
For nested facets (both nested links and nested checkbox types) we need to account for max # of facet values per facet AND Max Depth.
E.g. if max # of facet values per facet for Color facet is 2, and max depth is set to 2, we will have to account for 2x2=4 variations that will be used for sitemap building.
Let’s look at Reinders page as an example:
Department 0 = Category
Department/Subcategory 1 = Irrigation
Department/Subcategory 2 = Turn/Landscaping
Depth = 2 means that we will be using just Irrigation and Turn/Landscaping subcategories for sitemap building.
max # of facet values per facet = 2 means that we will use the first two facet values from each of those subcategories:
c. Accounting for Dynamic SEO Elements
Our customers have an option to add Canonical URLs :
The canonical URL logic should follow the limits as well if a facet value is not going to be included into the sitemap the canonical should point back to the main landing page when the facet is selected. The title, meta description and meta keywords logic of the page should also honor the limits.
Example Canonical Link:
http://www.hawkstore.com/landing_page_url/?color=blue&brand=adidas
d. Swatches, Range/Open Range, and Slider Facets
Swatches, ranges/open range facets are included in sitemap building.
Sliders are ignored from sitemap building as we don’t return facet values for sliders.
e. Facet boosting and Visibility Rules
Boosting and visibility rules on a facet are not respected for sitemap building.
Documentation:
Hawksearch Version (Lucene, ES, Both)? Both
Dashboard REST API Version required? n/a
Search API version required? n/a
If Lucene what front end version does client account need to be set to for getting properties in JSON: n/a
For ES what version of API has this available: n/a
For ES only: Indexing API version required: n/a
How is this feature enabled? AdditionalFeatures (described below), list of facet follow list needs to be defined to otherwise sitemap is not generated.
Feature Flag on System Parameters: AdditionalFeatures needs to contain 'seoImpact'
Is this available to all customers? yes
Requires any Customizations ? if so please detail them in the document: no
If related to Recommendations: what is the minimum Recommendations API version required? n/a
Tracking Dependencies if any and what is the tracking API version or tracking method (JS, API): n/a
Performance considerations if any to be noted: If to many facets are added to SEO influencer facet follow list, or the limits numbers are height, this may cause sitemap to timeout. In that case a process will be mark as ‘suspended' and sitemap won’t be generated.