Skip to content

pavel-karatsiuba/phet

 
 

Repository files navigation

Get it on Google Play

PhET Simulations scraper

This scraper creates offline versions in ZIM format of PhET science simulations in the ZIM format.

Requirements

It requires Node.js v10 or higher.

Quick Start

npm i && npm start

The above will eventually output a ZIM file to dist/

Build Status ![Docker Build Status](Docker Image Version (latest semver)) latest ZIM releases CodeFactor

Command line arguments

Available on GET and EXPORT steps only:

    --includeLanguages lang_1 [lang_2] [lang_3] ...
    --excludeLanguages lang_1 [lang_2] [lang_3] ...

Available on EXPORT step only:

    # skip ZIM files for individual languages
    --mulOnly

Example: npm run get -- --includeLanguages en ru fr

Config

Another way to configure behaviour is through environment variables. Sample .env file (with default values):

# request per second, affects GET step only
PHET_RPS=8
# async workers on TRANSFORM step (keep it equal to number of CPU cores)
PHET_WORKERS=10
# number of retries on GET step (delay grow with exponential backoff)
PHET_RETRIES=5
# display verbose errors
PHET_VERBOSE_ERRORS=false

About

This project achieves multiple things:

  • Download PhET content
  • Generate an Index for said content
  • Generate a ZIM file containing content and index

Things this project does not yet do, but should:

  • Generate Android APK

Usage

The functionality is split into 5 npm scripts:

  • npm run setup - deletes state from previous runs
  • npm run get - downloads PhET simulations in specified languages
  • npm run transform - prepare the content and media files
  • npm run export - generates ZIM file(s)
  • npm start - runs all of the above in sequence

The steps get, transform and export have their own output directories:

  • get outputs HTML and PNG files to state/get
  • transform outputs intermediate files to state/transform
  • export outputs HTML and PNG files to state/export AND a ZIM file(s) to dist/

About

A scraper and exporter for PhET content

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • JavaScript 91.4%
  • TypeScript 7.3%
  • Other 1.3%