E-mails, subdomains and names Harvester - OSINT
Find a file
dependabot[bot] 578a08f144
chore(deps-dev): bump types-requests from 2.28.7 to 2.28.8
Bumps [types-requests](https://github.com/python/typeshed) from 2.28.7 to 2.28.8.
- [Release notes](https://github.com/python/typeshed/releases)
- [Commits](https://github.com/python/typeshed/commits)

---
updated-dependencies:
- dependency-name: types-requests
  dependency-type: direct:development
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
2022-08-05 21:02:46 +00:00
.github A lot of changes and fixes (#1168) 2022-08-03 23:12:59 +01:00
bin Fix bug from stopping theHarvester.py from running and update the binary to matach the new theHarvester.py 2021-12-01 19:52:26 +00:00
README Update CONTRIBUTING.md 2020-04-02 10:06:59 +02:00
requirements chore(deps-dev): bump types-requests from 2.28.7 to 2.28.8 2022-08-05 21:02:46 +00:00
tests A lot of changes and fixes (#1168) 2022-08-03 23:12:59 +01:00
theHarvester A lot of changes and fixes (#1168) 2022-08-03 23:12:59 +01:00
wordlists Merge pull request #821 from Sbajary/Adding-Dorks 2021-08-05 14:01:44 -04:00
.dockerignore fix docker file and move it to use ubuntu and add docker ignore file 2021-06-28 21:37:47 +01:00
.flake8 rename setup.cfg to be .flake8 file 2022-06-09 23:41:10 +00:00
.gitattributes Removed google-profiles and clean up. 2019-02-13 23:05:52 -06:00
.gitignore Updated zoomeye module, updated user agents list, fixed substring not found, and replaced orjson in favor of ujson. (#923) 2021-11-22 09:26:49 +00:00
.lgtm.yml fix api_example issue and update lgtm.yml file 2021-07-01 22:48:08 +01:00
api-keys.yaml A lot of changes and fixes (#1168) 2022-08-03 23:12:59 +01:00
Dockerfile A lot of changes and fixes (#1168) 2022-08-03 23:12:59 +01:00
mypy.ini namespace_packages = True to mypy.ini 2021-08-02 16:58:58 +01:00
proxies.yaml Removed https proxies as aiohttp only officially sports http proxies. 2020-02-07 13:12:11 -05:00
pyproject.toml actually fix pyest config file 2022-06-10 00:19:46 +00:00
pytest.ini A lot of changes and fixes (#1168) 2022-08-03 23:12:59 +01:00
README.md A lot of changes and fixes (#1168) 2022-08-03 23:12:59 +01:00
requirements.txt added requirements.txt 2019-12-31 13:23:32 -05:00
restfulHarvest.py add auto reload support to the restapi launcher 2021-05-12 01:40:44 +01:00
setup.cfg A lot of changes and fixes (#1168) 2022-08-03 23:12:59 +01:00
setup.py Add api restfulHarvest script to setup.py and fix a bug in the api to handle that 2021-06-28 01:02:36 +01:00
theHarvester-logo.png Update theHarvester-logo.png 2019-09-10 23:14:45 +02:00
theHarvester.py Fix bug from stopping theHarvester.py from running and update the binary to matach the new theHarvester.py 2021-12-01 19:52:26 +00:00

theHarvester

TheHarvester CI TheHarvester Docker Image CI Language grade: Python Rawsec's CyberSecurity Inventory

What is this?

theHarvester is a very simple to use, yet powerful and effective tool designed to be used in the early stages of a
penetration test or red team engagement. Use it for open source intelligence (OSINT) gathering to help determine a
company's external threat landscape on the internet. The tool gathers emails, names, subdomains, IPs and URLs using
multiple public data sources that include:

Passive:

  • anubis: Anubis-DB - https://github.com/jonluca/anubis

  • baidu: Baidu search engine - www.baidu.com

  • binaryedge: List of known subdomains from www.binaryedge.io

  • bing: Microsoft search engine - www.bing.com

  • bingapi: Microsoft search engine, through the API (Requires an API key, see below.)

  • bufferoverun: Uses data from Rapid7's Project Sonar - www.rapid7.com/research/project-sonar/

  • censys: Censys search engine, will use certificates searches to enumerate subdomains and gather emails (Requires an API key, see below.) - censys.io

  • certspotter: Cert Spotter monitors Certificate Transparency logs - https://sslmate.com/certspotter/

  • crtsh: Comodo Certificate search - https://crt.sh

  • dnsdumpster: DNSdumpster search engine - https://dnsdumpster.com

  • duckduckgo: DuckDuckGo search engine - www.duckduckgo.com

  • fullhunt: The Next-Generation Attack Surface Security Platform - https://fullhunt.io

  • github-code: GitHub code search engine (Requires a GitHub Personal Access Token, see below.) - www.github.com

  • hackertarget: Online vulnerability scanners and network intelligence to help organizations - https://hackertarget.com

  • hunter: Hunter search engine (Requires an API key, see below.) - www.hunter.io

  • intelx: Intelx search engine (Requires an API key, see below.) - www.intelx.io

  • omnisint: Project Crobat, A Centralised Searchable Open Source Project Sonar DNS Database - https://github.com/Cgboal/SonarSearch

  • otx: AlienVault Open Threat Exchange - https://otx.alienvault.com

  • pentesttools: Powerful Penetration Testing Tools, Easy to Use (Requires an API key, see below.) - https://pentest-tools.com/home

  • projecdiscovery: We actively collect and maintain internet-wide assets data, to enhance research and analyse changes around DNS for better insights (Requires an API key, see below.) - https://chaos.projectdiscovery.io

  • qwant: Qwant search engine - www.qwant.com

  • rapiddns: DNS query tool which make querying subdomains or sites of a same IP easy! https://rapiddns.io

  • rocketreach: Access real-time verified personal/professional emails, phone numbers, and social media links. - https://rocketreach.co

  • securityTrails: Security Trails search engine, the world's largest repository of historical DNS data
    (Requires an API key, see below.) - www.securitytrails.com

  • shodan: Shodan search engine, will search for ports and banners from discovered hosts (Requires an API key, see below.) - www.shodanhq.com

  • spyse: Spyse is a search engine built for a quick cyber intelligence of IT infrastructures, networks, and even the smallest parts of the internet. (Requires an API key, see below.) - spyse.com

  • sublist3r: Fast subdomains enumeration tool for penetration testers - https://api.sublist3r.com/search.php?domain=example.com

  • threatcrowd: Open source threat intelligence - www.threatcrowd.org

  • threatminer: Data mining for threat intelligence - https://www.threatminer.org/

  • urlscan: A sandbox for the web that is a URL and website scanner - https://urlscan.io

  • vhost: Bing virtual hosts search

  • virustotal: virustotal.com domain search

  • yahoo: Yahoo search engine

  • zoomeye: China version of shodan - https://www.zoomeye.org

Active:

  • DNS brute force: dictionary brute force enumeration
  • Screenshots: Take screenshots of subdomains that were found

Modules that require an API key:

Documentation to setup API keys can be found at - https://github.com/laramies/theHarvester/wiki/Installation#api-keys

  • binaryedge - not free
  • bing
  • censys - API keys are required and can be retrieved from your Censys account.
  • fullhunt
  • github
  • hunter - limited to 10 on the free plan, so you will need to do -l 10 switch
  • intelx
  • pentesttools - not free
  • projecdiscovery - invite only for now
  • rocketreach - not free
  • securityTrails
  • shodan
  • spyse - not free
  • zoomeye

Install and dependencies:

Comments, bugs, and requests:

Main contributors:

  • Twitter Follow Matthew Brown @NotoriousRebel1
  • Twitter Follow Jay "L1ghtn1ng" Townsend @jay_townsend1
  • Twitter Follow Lee Baird @discoverscripts

Thanks:

  • John Matherly - Shodan project
  • Ahmed Aboul Ela - subdomain names dictionaries (big and small)