Limitations of Our Premium Full Text RSS Feeds Service
Our premium Full Text service, available via our Full Text RSS Feeds add-on, is not a perfect solution.
It has its limitations due to various factors. Below is a brief explanation of these possible limits.
1. Not importing all the content from the original post
While in some cases the Full Text algorithm will not produce the desired results, this is not a bug, but instead a result of the lack of technological means.
The Full Text algorithm works by going to each individual item’s page and reading that page in order to determine the actual full content of the item, and some other data about it, such as the featured image. The software behind this service does not see what humans see, but instead, it sees HTML markup, and from its structure attempts to determine what is what. From this content, the service is able to determine if the content is to be imported and parsed as new post content.
However, in some cases, it is not possible due to the wrong structurization of the page’s markup and lack of semantic connection between elements of the page that identify it as an “article”. You can read more about how the Full Text service works here.
2. Trying to import too many feed items via the Full Text service
When it comes to certain very large feeds (around 100 feed items or more), the Full Text script might not be able to import the full list of items due to the limitations of this script.
The Full Text RSS Feeds script is not designed to process very large numbers of items. It might work for some sites, but it causes a lot of strain on the server since it will be pulling in all the posts, storing them in memory, and extracting their contents, all within the span of one request.
A common way to work around this limitation is to stagger the Feed fetch schedule for the various Feed Sources, such that the process does not strain the service.
3. Failing to import content due to site inaccessibility
Since the service has to fetch the content from the original site, some sites have added limitations that prevent our service from reading this content such as token authorization. For such sites, the only available option would be to fetch the content available within the RSS feed.
4. Failing to import feature image from the first image on the page
It's common for websites to add the featured image as the first image on a page but outside the post's content. In this case, if the image is not included in the RSS feed, the service is unable to import the image. The Full-Text RSS service is designed to read the content from the post. It's not able to load the posts information that's added outside the post content.