E-mails, subdomains and names Harvester - OSINT
Go to file
dependabot[bot] 0c3b9daece
chore(deps): bump actions/setup-python from 2.3.2 to 3
Bumps [actions/setup-python](https://github.com/actions/setup-python) from 2.3.2 to 3.
- [Release notes](https://github.com/actions/setup-python/releases)
- [Commits](https://github.com/actions/setup-python/compare/v2.3.2...v3)

---
updated-dependencies:
- dependency-name: actions/setup-python
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
2022-02-28 22:07:59 +00:00
.github chore(deps): bump actions/setup-python from 2.3.2 to 3 2022-02-28 22:07:59 +00:00
bin Fix bug from stopping theHarvester.py from running and update the binary to matach the new theHarvester.py 2021-12-01 19:52:26 +00:00
README Update CONTRIBUTING.md 2020-04-02 10:06:59 +02:00
requirements chore(deps): bump shodan from 1.26.1 to 1.27.0 2022-02-23 22:05:21 +00:00
tests Add new n45ht module, lots of results returned (#899) 2021-10-24 04:10:37 +01:00
theHarvester fix: fix issue with twitter source search and proxy varible not updating issue 2022-01-03 21:00:14 +05:30
wordlists Merge pull request #821 from Sbajary/Adding-Dorks 2021-08-05 14:01:44 -04:00
.dockerignore fix docker file and move it to use ubuntu and add docker ignore file 2021-06-28 21:37:47 +01:00
.gitattributes Removed google-profiles and clean up. 2019-02-13 23:05:52 -06:00
.gitignore Updated zoomeye module, updated user agents list, fixed substring not found, and replaced orjson in favor of ujson. (#923) 2021-11-22 09:26:49 +00:00
.lgtm.yml fix api_example issue and update lgtm.yml file 2021-07-01 22:48:08 +01:00
api-keys.yaml Add new fullhunt module (#894) 2021-10-18 00:29:18 +01:00
Dockerfile Fixes #991 thanks @phplucas for the fix 2022-02-12 23:14:20 +00:00
mypy.ini namespace_packages = True to mypy.ini 2021-08-02 16:58:58 +01:00
proxies.yaml Removed https proxies as aiohttp only officially sports http proxies. 2020-02-07 13:12:11 -05:00
README.md Update README.md 2021-10-30 13:42:12 -04:00
requirements.txt added requirements.txt 2019-12-31 13:23:32 -05:00
restfulHarvest.py add auto reload support to the restapi launcher 2021-05-12 01:40:44 +01:00
setup.cfg remove commented out flake8 param 2019-09-26 21:48:40 +01:00
setup.py Add api restfulHarvest script to setup.py and fix a bug in the api to handle that 2021-06-28 01:02:36 +01:00
theHarvester-logo.png Update theHarvester-logo.png 2019-09-10 23:14:45 +02:00
theHarvester.py Fix bug from stopping theHarvester.py from running and update the binary to matach the new theHarvester.py 2021-12-01 19:52:26 +00:00

theHarvester

TheHarvester CI TheHarvester Docker Image CI Language grade: Python Rawsec's CyberSecurity Inventory

What is this?

theHarvester is a very simple to use, yet powerful and effective tool designed to be used in the early stages of a
penetration test or red team engagement. Use it for open source intelligence (OSINT) gathering to help determine a
company's external threat landscape on the internet. The tool gathers emails, names, subdomains, IPs and URLs using
multiple public data sources that include:

Passive:

  • anubis: Anubis-DB - https://github.com/jonluca/anubis

  • baidu: Baidu search engine - www.baidu.com

  • binaryedge: placeholder - www.binaryedge.io

  • bing: Microsoft search engine - www.bing.com

  • bingapi: Microsoft search engine, through the API (Requires an API key, see below.)

  • bufferoverun: Uses data from Rapid7's Project Sonar - www.rapid7.com/research/project-sonar/

  • censys: Censys search engine, will use certificates searches to enumerate subdomains and gather emails (Requires an API key, see below.) - censys.io

  • certspotter: Cert Spotter monitors Certificate Transparency logs - https://sslmate.com/certspotter/

  • crtsh: Comodo Certificate search - https://crt.sh

  • dnsdumpster: DNSdumpster search engine - https://dnsdumpster.com

  • duckduckgo: DuckDuckGo search engine - www.duckduckgo.com

  • fullhunt: The Next-Generation Attack Surface Security Platform - https://fullhunt.io

  • github-code: GitHub code search engine (Requires a GitHub Personal Access Token, see below.) - www.github.com

  • google: Google search engine (Optional Google dorking.) - www.google.com

  • hackertarget: Online vulnerability scanners and network intelligence to help organizations - https://hackertarget.com

  • hunter: Hunter search engine (Requires an API key, see below.) - www.hunter.io

  • intelx: Intelx search engine (Requires an API key, see below.) - www.intelx.io

  • linkedin: Google search engine, specific search for LinkedIn users - www.linkedin.com

  • linkedin_links: specific search for LinkedIn users for target domain (Uses Google search.)

  • n45ht: - https://n45ht.or.id

  • omnisint: Project Crobat, A Centralised Searchable Open Source Project Sonar DNS Database - https://github.com/Cgboal/SonarSearch

  • otx: AlienVault Open Threat Exchange - https://otx.alienvault.com

  • pentesttools: Powerful Penetration Testing Tools, Easy to Use (Requires an API key, see below.) - https://pentest-tools.com/home

  • projecdiscovery: We actively collect and maintain internet-wide assets data, to enhance research and analyse changes around DNS for better insights (Requires an API key, see below.) - https://chaos.projectdiscovery.io

  • qwant: Qwant search engine - www.qwant.com

  • rapiddns: DNS query tool which make querying subdomains or sites of a same IP easy! https://rapiddns.io

  • rocketreach: Access real-time verified personal/professional emails, phone numbers, and social media links. - https://rocketreach.co

  • securityTrails: Security Trails search engine, the world's largest repository of historical DNS data
    (Requires an API key, see below.) - www.securitytrails.com

  • shodan: Shodan search engine, will search for ports and banners from discovered hosts (Requires an API key, see below.) - www.shodanhq.com

  • spyse: Spyse is a search engine built for a quick cyber intelligence of IT infrastructures, networks, and even the smallest parts of the internet. (Requires an API key, see below.) - spyse.com

  • sublist3r: Fast subdomains enumeration tool for penetration testers - https://api.sublist3r.com/search.php?domain=example.com

  • threatcrowd: Open source threat intelligence - www.threatcrowd.org

  • threatminer: Data mining for threat intelligence - https://www.threatminer.org/

  • trello: Search trello boards (Uses Google search.)

  • twitter: Twitter accounts related to a specific domain (Uses Google search.)

  • urlscan: A sandbox for the web that is a URL and website scanner - https://urlscan.io

  • vhost: Bing virtual hosts search

  • virustotal: virustotal.com domain search

  • yahoo: Yahoo search engine

  • zoomeye: China version of shodan - https://www.zoomeye.org

Active:

  • DNS brute force: dictionary brute force enumeration
  • Screenshots: Take screenshots of subdomains that were found

Modules that require an API key:

Documentation to setup API keys can be found at - https://github.com/laramies/theHarvester/wiki/Installation#api-keys

  • binaryedge - not free
  • bing
  • censys - API keys are required and can be retrieved from your Censys account.
  • fullhunt
  • github
  • hunter - limited to 10 on the free plan so you will need to do -l 10 switch
  • intelx
  • pentesttools - not free
  • projecdiscovery - invite only for now
  • rocketreach - not free
  • securityTrails
  • shodan
  • spyse - not free
  • zoomeye

Install and dependencies:

Comments, bugs, and requests:

Main contributors:

  • Twitter Follow Matthew Brown @NotoriousRebel1
  • Twitter Follow Jay "L1ghtn1ng" Townsend @jay_townsend1
  • Twitter Follow Lee Baird @discoverscripts

Thanks:

  • John Matherly - Shodan project
  • Ahmed Aboul Ela - subdomain names dictionaries (big and small)