A crawler to find websites that exercise code in Firefox that is not covered by unit tests
Обновлено 2023-11-01 20:49:39 +03:00
Crawl a website and run it through Google lighthouse
Обновлено 2022-04-27 21:45:13 +03:00
A crawler that uses OpenWPM.
Обновлено 2021-12-26 23:12:55 +03:00
An automated and scalable approach to generate tasklets from a natural language task query and a website URL. Glider does not require any pre-training. Glider models tasklet extraction as a state space search, where agents can explore a website’s UI and get rewarded when making progress towards task completion. The reward is computed based on the agent’s navigating pattern and the similarity between its trajectory and the task query.
Обновлено 2021-09-03 06:52:47 +03:00
Crawler Dashboard to control ghcrawler application.
Обновлено 2020-10-20 00:53:30 +03:00
Crawl GitHub APIs and store the discovered orgs, repos, commits, ...
Обновлено 2020-09-04 18:08:29 +03:00
This script is used within our Bing and Interop crawlers to determine the properties used on a page and generalized values that could have been used.
Обновлено 2018-08-08 03:37:34 +03:00
This is where we will store our crawler specific code.
Обновлено 2017-11-29 04:03:23 +03:00
A simple command line app for controlling a GitHub crawler
Обновлено 2017-09-08 22:25:13 +03:00
Data Lake ETL Code for GHCrawler
Обновлено 2017-08-23 23:37:11 +03:00
INACTIVE - http://mzl.la/ghe-archive - Progressive Web App Crawler
Обновлено 2016-04-06 13:50:43 +03:00