Before when working with the Sources module, advanced setting were available that allowed you to alter the scrape parameters for the selected source, and add a list of search and replace commands on top of the content you discovered. We took the ladder(String Searching and Replacement commands) and changed it to Regex Search and Replace commands, so you now have the capability to perform even more complex content manipulations on your content. I’ll admit, regex(regular expression) commands are complicated. But sometimes they are the only solution to cleaning out unwanted elements in your scraped content.
A PDF on how to work with regex has been included/embedded in the relevant areas. Just as before, you will have the option to add search and replace functions directly within the article search’s advanced setting’s section, or you can add them permanently by going into your source management section and altering the parameters there.
And note:… if you have a question on how to write the regex code that will do exactly what you need… take it to StackOverflow : D . Really! they’ll know exactly what commands to run to achieve your desired results.
IndexSpy was never really meant to rest as is. Organizing data like that screams to have advanced functionality developed into it, and thats just what we have done.
You can now select pages, individually or sporadically, and export them as rss feeds. For Example: 20 pages indexed, 5 pages unindexed. Sort pages by index result, manually select the 5 unindexed articles, and export as rss. You now have a customized rss feed for rss submission.