E-mails, subdomains and names Harvester - OSINT
Find a file
2019-09-01 02:08:58 +01:00
.github/workflows Update theHarvester.yml 2019-09-01 01:51:48 +01:00
tests Updated some files to conform to pep8 standards fixing flake8 issues. 2019-08-21 13:42:42 -04:00
theHarvester Travis fixes/flake8 fixes 2019-09-01 01:27:13 +01:00
wordlists Syncing and updated crtsh to work properly. 2019-08-08 00:27:42 -04:00
.gitattributes Removed google-profiles and clean up. 2019-02-13 23:05:52 -06:00
.gitignore Removed duplicate entry. 2019-03-23 17:31:20 -05:00
.travis.yml Travis fixes, flake8 fixes 2019-09-01 01:12:16 +01:00
api-keys.yaml Syncing and updated crtsh to work properly. 2019-08-08 00:27:42 -04:00
changelog.txt Updated README and removed stale code. 2019-02-27 21:31:45 -06:00
COPYING Fix line endings. 2019-02-04 13:48:29 +01:00
Dockerfile Dockerfile fix 2019-08-12 18:17:00 +02:00
LICENSES Fix line endings. 2019-02-04 13:48:29 +01:00
README.md Fix travis and remove all as a source from the readme 2019-09-01 00:39:17 +01:00
requirements.txt Bump pytest from 5.1.1 to 5.1.2 2019-08-30 19:24:13 +00:00
setup.cfg Travis fixes/flake8 fixes 2019-09-01 01:27:13 +01:00
setup.py Travis fixes/flake8 fixes 2019-09-01 01:27:13 +01:00
theHarvester.py Updated some files to conform to pep8 standards fixing flake8 issues. 2019-08-21 13:42:42 -04:00

*******************************************************************
*                                                                 *
* | |_| |__   ___    /\  /\__ _ _ ____   _____  ___| |_ ___ _ __  *
* | __| '_ \ / _ \  / /_/ / _` | '__\ \ / / _ \/ __| __/ _ \ '__| *
* | |_| | | |  __/ / __  / (_| | |   \ V /  __/\__ \ ||  __/ |    *
*  \__|_| |_|\___| \/ /_/ \__,_|_|    \_/ \___||___/\__\___|_|    *
*                                                                 *
* theHarvester 3.1.0 dev                                          *
* Coded by Christian Martorella                                   *
* Edge-Security Research                                          *
* cmartorella@edge-security.com                                   *
*******************************************************************

Build Status Language grade: Python

What is this?

theHarvester is a very simple, yet effective tool designed to be used in the early
stages of a penetration test. Use it for open source intelligence gathering and
helping to determine a company's external threat landscape on the internet. The
tool gathers emails, names, subdomains, IPs, and URLs using multiple public data
sources that include:

Passive:

  • baidu: Baidu search engine - www.baidu.com

  • bing: Microsoft search engine - www.bing.com

  • bingapi: Microsoft search engine, through the API (Requires API key, see below.)

  • censys: Censys.io search engine - www.censys.io

  • crtsh: Comodo Certificate search - www.crt.sh

  • dnsdumpster: DNSdumpster search engine - dnsdumpster.com

  • dogpile: Dogpile search engine - www.dogpile.com

  • duckduckgo: DuckDuckGo search engine - www.duckduckgo.com

  • github-code: Github code search engine (Requires Github Personal Access Token, see below.) - www.github.com

  • google: Google search engine (Optional Google dorking.) - www.google.com

  • hunter: Hunter search engine (Requires API key, see below.) - www.hunter.io

  • intelx: Intelx search engine (Requires API key, see below.) - www.intelx.io

  • linkedin: Google search engine, specific search for Linkedin users - www.linkedin.com

  • netcraft: Netcraft Data Mining - www.netcraft.com

  • securityTrails: Security Trails search engine, the world's largest repository
    of historical DNS data (Requires API key, see below.) - www.securitytrails.com

  • shodan: Shodan search engine, will search for ports and banners from discovered
    hosts - www.shodanhq.com

  • threatcrowd: Open source threat intelligence - www.threatcrowd.org

  • trello: Search trello boards (Uses Google search.)

  • twitter: Twitter accounts related to a specific domain (Uses Google search.)

  • vhost: Bing virtual hosts search

  • virustotal: virustotal.com domain search

  • yahoo: Yahoo search engine

Active:

  • DNS brute force: dictionary brute force enumeration
  • DNS reverse lookup: reverse lookup of IP´s discovered in order to find hostnames
  • DNS TDL expansion: TLD dictionary brute force enumeration

Modules that require an API key:

Add your keys to api-keys.yaml

  • bingapi
  • github
  • hunter
  • intelx
  • securityTrails
  • shodan

Dependencies:

  • Python 3.6+
  • python3 -m pip install -r requirements.txt
  • Recommend that you use a virtualenv when cloning from git

Comments, bugs, or requests?

Main contributors:

  • Twitter Follow Matthew Brown @NotoriousRebel1
  • Twitter Follow Jay "L1ghtn1ng" Townsend @jay_townsend1
  • LinkedIn Janos Zold
  • Twitter Follow Lee Baird @discoverscripts

Thanks:

  • John Matherly - Shodan project
  • Ahmed Aboul Ela - subdomain names dictionaries (big and small)