You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fixed App#refresh_cmd() to also copy Cri's default_proc to the new Hash for the command options.
Fixed to check for non-strings for JSON & URI.
For JSON, convert StringIO to string in DictScraper.scrape().
For URL, convert URL using URI() because URI.parse() will crash with a non-string (URI object) in Scraper.open_url().
Fixed to scrape multiple HTML Ruby tag words (instead of just 1).
I thought multiple Ruby bases/texts (<rb>/<rt>) were invalid, but after running into the article below and checking the HTML with a validator, it's actually valid HTML:
- https://www3.nhk.or.jp/news/easy/k10012759201000/k10012759201000.html
- No previous articles/URLs ran into this problem (would have raised an error), so it should only be a problem with this specific, new article.