How does Marfeel detect the Title of an article

Marfeel automatically extracts the title of an article following this waterfall of heuristics:

  1. JSON+LD. More information
    <script type="application/ld+json"> { "@context": "https://schema.org", "@type": "NewsArticle", "headline": "Title of a News Article" } </script>
  2. Microdata
    <div itemprop="headline">Title of News Article</div>
  3. <title> meta tag. This might not be an ideal fallback since most sites add the name of the site on the their title.

Posted

in