Homepage

v1.0.0-beta5

  • The extraction API has been revamped

    • extract() will now return just the extracted values instead of wrapper objects. Learn more in the updated documentation
    • HTML attributes and data-attributes are now supported
    • Multiple extractions in the same extract() call and nested extractions have been removed
    • New methods extractFirst() and extractLast() can be used to extract a single value of the first or the last match respectively
    • New method join() can be used to properly group multiple sets of related data
    • Inferred return types have been greatly improved
  • The builtin scripts have been updated to reflect the new extraction semantics
  • Primitive values can now be yielded as output from scrapers or scripts without the need to be wrapped in objects
  • Random protocol and bridge ports can be used by setting them to 0 in the config
  • Multiple projects can now be run at the same time without interfering with each other
  • Multiple resumable sessions of the same project can exist and run (even at the same time) by using the --sessionKey CLI argument

Update to the latest version by running:

npm update -g ayakashi

A new @ayakashi/types package is also available. It can be updated in existing projects by running npm --save-dev @ayakashi/types

Update to the latest version by running:

npm update -g ayakashi

A new @ayakashi/types package is also available. It can be updated in existing projects by running npm update.

Update to the latest version by running:

npm update -g ayakashi
  • Each scrapper invocation now runs in its own browser context (like an incognito window) unless persistentSession is on or not running in headless mode.
  • Add support for renderless scrappers. Learn more here: https://ayakashi.io/docs/guide/renderless-scrappers.html
  • Add support for API scrappers. Learn more here: https://ayakashi.io/docs/guide/api-scrappers.html
  • saveToSQL script will now re-use the database connection and won't create a new one each time it is run.
  • Add recursiveYield() and recursiveYieldEach() methods, which allow to recursively re-run the scrapper by yielding the extracted data to itself.
  • domQL tagName queries will now accept both lowercase and uppercase tag names.

Update to the latest version by running:

npm update -g ayakashi

A new @ayakashi/types package is also available. It can be updated in existing projects by running npm update.

  • use a new, more efficient chromium downloader
  • start using a global store folder
  • use a recommended chromium revision instead of downloading the latest one
  • chromium will now be downloaded only once in the global store folder (instead of a local .chromium directory)
  • local .cache and .session directories are also moved to the global store
  • show a notification if there is a new update or chromium revision

Update to the latest version by running:

npm update -g ayakashi

The first major beta release 🎉

Changes

Upgrade to the latest version by running:

npm install -g ayakashi

Another stability focused release before moving to beta 2.

  • fix process buffer overflow
  • fix long-running event memory leak
  • small fixes and improvements

Upgrade to the latest version by running:

npm install -g ayakashi

Fix memory leak caused by zombie connections

Includes some extra logging to help debugging