E-mails, subdomains and names Harvester - OSINT
Find a file
2020-07-19 06:29:00 +01:00
.github update GH ci 2020-07-19 06:29:00 +01:00
README Update CONTRIBUTING.md 2020-04-02 10:06:59 +02:00
requirements Bump plotly from 4.8.2 to 4.9.0 in /requirements 2020-07-17 05:09:08 +00:00
tests Fix threatminer unittest 2020-06-05 15:39:11 +01:00
theHarvester Resolve merge conflict. 2020-07-08 23:33:42 -04:00
wordlists Syncing and updated crtsh to work properly. 2019-08-08 00:27:42 -04:00
.gitattributes Removed google-profiles and clean up. 2019-02-13 23:05:52 -06:00
.gitignore Minor additions (DS_Store, HTM) 2020-05-13 21:59:57 +02:00
.lgtm.yml Make sure we set lgtm.yml to use python3 2019-09-22 22:24:22 +01:00
.travis.yml Fix flake8 issue and add the new module to CI 2020-05-12 21:22:12 +01:00
api-keys.yaml Remove the shodan key as it has been revocked 2020-05-26 21:49:17 +01:00
Dockerfile Update readme and deps and fix docker build 2020-03-22 17:36:41 +00:00
mypy.ini Update mypy settings to use the new features of the 0.730 release 2019-09-26 19:36:01 +01:00
Pipfile Bump plotly from 4.8.2 to 4.9.0 2020-07-17 05:19:02 +00:00
Pipfile.lock Bump plotly from 4.8.2 to 4.9.0 2020-07-17 05:19:02 +00:00
proxies.yaml Removed https proxies as aiohttp only officially sports http proxies. 2020-02-07 13:12:11 -05:00
README.md Readded certifi to requirements.txt and added certifi to screenshot.py when fetching urls. 2020-07-08 23:30:24 -04:00
requirements.txt added requirements.txt 2019-12-31 13:23:32 -05:00
setup.cfg remove commented out flake8 param 2019-09-26 21:48:40 +01:00
setup.py Fix proxy in setup that was missed and disable github test 2020-02-11 16:24:38 +00:00
theHarvester-logo.png Update theHarvester-logo.png 2019-09-10 23:14:45 +02:00
theHarvester.py Reporting should be working on linux at least... 2020-07-04 15:38:11 -04:00

theHarvester

Build Status Language grade: Python Rawsec's CyberSecurity Inventory

What is this?

theHarvester is a very simple to use, yet powerful and effective tool designed to be used in the early stages of a
penetration test or red team engagement. Use it for open source intelligence (OSINT) gathering to help determine a
company's external threat landscape on the internet. The tool gathers emails, names, subdomains, IPs and URLs using
multiple public data sources that include:

Passive:

  • baidu: Baidu search engine - www.baidu.com

  • bing: Microsoft search engine - www.bing.com

  • bingapi: Microsoft search engine, through the API (Requires an API key, see below.)

  • bufferoverun: Uses data from Rapid7's Project Sonar - www.rapid7.com/research/project-sonar/

  • certspotter: Cert Spotter monitors Certificate Transparency logs - https://sslmate.com/certspotter/

  • crtsh: Comodo Certificate search - https://crt.sh

  • dnsdumpster: DNSdumpster search engine - https://dnsdumpster.com

  • dogpile: Dogpile search engine - www.dogpile.com

  • duckduckgo: DuckDuckGo search engine - www.duckduckgo.com

  • exalead: a Meta search engine - www.exalead.com/search

  • github-code: GitHub code search engine (Requires a GitHub Personal Access Token, see below.) - www.github.com

  • google: Google search engine (Optional Google dorking.) - www.google.com

  • hackertarget: Online vulnerability scanners and network intelligence to help organizations - https://hackertarget.com

  • hunter: Hunter search engine (Requires an API key, see below.) - www.hunter.io

  • intelx: Intelx search engine (Requires an API key, see below.) - www.intelx.io

  • linkedin: Google search engine, specific search for LinkedIn users - www.linkedin.com

  • linkedin_links:

  • netcraft: Internet Security and Data Mining - www.netcraft.com

  • otx: AlienVault Open Threat Exchange - https://otx.alienvault.com

  • pentesttools: Powerful Penetration Testing Tools, Easy to Use (Needs an API key and is not free for API access) - https://pentest-tools.com/home

  • rapiddns: DNS query tool which make querying subdomains or sites of a same IP easy! https://rapiddns.io

  • securityTrails: Security Trails search engine, the world's largest repository of historical DNS data
    (Requires an API key, see below.) - www.securitytrails.com

  • shodan: Shodan search engine, will search for ports and banners from discovered hosts - www.shodanhq.com

  • spyse: Web research tools for professionals (Requires an API key.) - https://spyse.com

  • sublist3r: Fast subdomains enumeration tool for penetration testers - https://api.sublist3r.com/search.php?domain=example.com

  • Suip: Web research tools that can take over 10 minutes to run, but worth the wait - https://suip.biz

  • threatcrowd: Open source threat intelligence - www.threatcrowd.org

  • threatminer: Data mining for threat intelligence - https://www.threatminer.org/

  • trello: Search trello boards (Uses Google search.)

  • twitter: Twitter accounts related to a specific domain (Uses Google search.)

  • urlscan: A sandbox for the web that is a URL and website scanner - https://urlscan.io

  • vhost: Bing virtual hosts search

  • virustotal: virustotal.com domain search

  • yahoo: Yahoo search engine

  • all:

Active:

  • DNS brute force: dictionary brute force enumeration
  • Screenshots: Take screenshots of subdomains that were found

Modules that require an API key:

Documentation to setup API keys can be found at - https://github.com/laramies/theHarvester/wiki/Installation#api-keys

  • bing
  • github
  • hunter
  • intelx
  • pentesttools
  • securityTrails
  • shodan
  • spyse

Install and dependencies:

Comments, bugs and requests:

Main contributors:

  • Twitter Follow Matthew Brown @NotoriousRebel1
  • Twitter Follow Jay "L1ghtn1ng" Townsend @jay_townsend1
  • Twitter Follow Lee Baird @discoverscripts
  • LinkedIn Janos Zold

Thanks:

  • John Matherly - Shodan project
  • Ahmed Aboul Ela - subdomain names dictionaries (big and small)