Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Medline plus in English #991

Closed
Popolechien opened this issue May 10, 2024 · 14 comments
Closed

Medline plus in English #991

Popolechien opened this issue May 10, 2024 · 14 comments
Assignees
Labels
Medical Medical related Content Zimit

Comments

@Popolechien
Copy link
Collaborator

Popolechien commented May 10, 2024

  • Website URL: https://medlineplus.gov/ency/
  • License: Public domain
  • Desired ZIM Title: Medline Plus
  • Desired ZIM Description: Trusted Health Information for You
  • Desired ZIM Icon –png (URL or attach one): n/a
  • Language (ISO 639-3): eng
  • Is this a MediaWiki?: no

See also #992

@Popolechien Popolechien added the Medical Medical related Content label May 10, 2024
@Popolechien
Copy link
Collaborator Author

Popolechien commented May 10, 2024

Notes :
Only the medical encyclopedia part is being targeted. https://medlineplus.gov/ency/ redirects to https://medlineplus.gov/encyclopedia.html so I'm not sure how that will influence things

@Popolechien
Copy link
Collaborator Author

If the scrape does not work then https://iiab.me/modules/en-medline_plus/ is a possible alternative.

@MrnateGeek
Copy link

I got the icon in 5 seconds
https://medlineplus.gov/images/touch-icon.png

@kelson42 kelson42 changed the title Medline plus Medline plus in English May 11, 2024
@kelson42
Copy link
Collaborator

@Popolechien Why only the encyclopedia part? For example https://medlineplus.gov/healthtopics.html seems of interest as well?

@Popolechien
Copy link
Collaborator Author

@kelson42 that would be #994

@benoit74 benoit74 self-assigned this Sep 20, 2024
@benoit74
Copy link
Contributor

Recipe created at https://farm.openzim.org/recipes/medlineplus.gov_en_ency ; limit set to only 100 pages for now to check website behavior (custom CSS will be needed anyway)

@benoit74
Copy link
Contributor

Discussed atm with Popolechien, we will ZIM the whole website, we do not see real arguments for ZIMing only the encyclopedia

@benoit74
Copy link
Contributor

Custom CSS developed and recipe reconfigured.

Launching with only 1000 pages for now.

Will probably have an issue with pages with videos like https://medlineplus.gov/ency/anatomyvideos/000002.htm (will probably clob the ZIM but not work at all, same issue as #323: openzim/zimit#353)

@benoit74
Copy link
Contributor

@benoit74
Copy link
Contributor

No issue found with video, file is simply an mp4 and everything is working well on test page mentioned above.

Just started the full ZIM creation in dev.

@benoit74
Copy link
Contributor

File is ready in dev: https://dev.library.kiwix.org/#lang=eng&q=medline, looks ok to me.

Please review as well before moving to prod

@Popolechien
Copy link
Collaborator Author

LGTM.

On a side note, the huge number of external links makes it all the more important to look into openzim/zimit/issues/374 and decide on blocking them or at a very minimum flag them.

@benoit74
Copy link
Contributor

benoit74 commented Oct 5, 2024

Requested in production, will update when file is ready.

@benoit74
Copy link
Contributor

benoit74 commented Oct 7, 2024

File is ready in production.

Note that on library.kiwix.org it is impacted (and Medline ES too) by kiwix/operations#280 but ZIM is OK, this is only a problem with our infra / kiwix-serve instance, not the ZIM itself, so I close the issue.

ZIM is playing fine in dev, with kiwix apple reader, and would probably play fine on a hotspot.

@benoit74 benoit74 closed this as completed Oct 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Medical Medical related Content Zimit
Projects
None yet
Development

No branches or pull requests

4 participants