-
|
I use FreshRSS XPath scraping to create feeds for upcoming events from the websites of local venues in my city. This has worked amazing up until recently when one of the venues changed the output of their website's event calendar from HTML to JavaScript. Unfortunately, the entire script field is too long to post here, but the event data I'm trying to grab is buried within <script type="text/javascript">. All events are nested under n.performances in the following format. I'm trying to grab title, imageUrl, location, moreDetailsUrl, and performanceDate fields out of each performance listed in the script. I tried configuring the feed source using HTML + XPath + JSON and pointing it at //script[@type="text/javascript"] but I'm unsure how to configure the remaining fields to pull the event data I need. Nothing I've tried is working and the feed is not pulling any events. FreshRSS log output isn't giving me much information besides telling me the parsing failed Is it possible for FreshRSS to scrape the event data from this site calendar, and if so, could someone help point me in the right direction? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 11 replies
-
|
Hello, |
Beta Was this translation helpful? Give feedback.
@dahlbergc See #7369. Tests welcome. This is because the pages you are trying to process contain multiple JSON fragments, which was not supported so far, but I have just implemented it