xurlfind3r icon indicating copy to clipboard operation
xurlfind3r copied to clipboard

A command-line interface (CLI) based passive URLs discovery utility. It is designed to efficiently identify known URLs of given domains by tapping into a multitude of curated online passive sources.

hqurlfind3r

release maintenance open issues closed issues license twitter

A passive reconnaissance tool for known URLs discovery - it gathers a list of URLs passively using various online sources.

Resource

  • Features
  • Installation
    • From Binary
    • From source
    • From github
  • Post Installation
  • Usage
    • Examples
      • Basic
      • Regex filter URLs
      • Include Subdomains' URLs
  • Contribution

Features

  • Collect known URLs:
  • Reduce noise:
    • Regex filter URLs.
    • Removes duplicate pages in the sense of URL patterns that are probably repetitive and points to the same web template.
  • Output to stdout for piping or save to file.

Installation

From Binary

You can download the pre-built binary for your platform from this repository's releases page, extract, then move it to your $PATHand you're ready to go.

From Source

hqurlfind3r requires go1.17+ to install successfully. Run the following command to get the repo

go install -v github.com/hueristiq/hqurlfind3r/cmd/hqurlfind3r@latest

From Github

git clone https://github.com/hueristiq/hqurlfind3r.git && \
cd hqurlfind3r/cmd/hqurlfind3r/ && \
go build; mv hqurlfind3r /usr/local/bin/ && \
hqurlfind3r -h

Post Installation

hqurlfind3r will work after installation. However, to configure hqurlfind3r to work with certain services - currently github - you will need to have setup API keys. The API keys are stored in the $HOME/.config/hqurlfind3r/conf.yaml file - created upon first run - and uses the YAML format. Multiple API keys can be specified for each of these services.

Example:

version: 1.9.0
sources:
    - commoncrawl
    - github
    - intelx
    - otx
    - urlscan
    - wayback
    - waybackrobots
keys:
    github:
        - d23a554bbc1aabb208c9acfbd2dd41ce7fc9db39
        - asdsd54bbc1aabb208c9acfbd2dd41ce7fc9db39
    intelx:
        - 2.intelx.io:00000000-0000-0000-0000-000000000000

Usage

DiSCLAIMER: fetching urls from github is a bit slow.

hqurlfind3r -h

This will display help for the tool.

 _                      _  __ _           _ _____      
| |__   __ _ _   _ _ __| |/ _(_)_ __   __| |___ / _ __ 
| '_ \ / _` | | | | '__| | |_| | '_ \ / _` | |_ \| '__|
| | | | (_| | |_| | |  | |  _| | | | | (_| |___) | |   
|_| |_|\__, |\__,_|_|  |_|_| |_|_| |_|\__,_|____/|_| v1.9.0
          |_|
USAGE:
  hqurlfind3r [OPTIONS]

OPTIONS:
   -d, --domain            domain to fetch urls for
  -eS, --exclude-sources   comma(,) separated list of sources to exclude
   -f, --filter            URL filtering regex
  -iS, --include-subs      include subdomains' urls
  -lS, --list-sources      list all the available sources
  -nC, --no-color          no color mode
   -s  --silent            silent mode: output urls only
  -uS, --use-sources       comma(,) separated list of sources to use
   -o, --output            output file

Examples

Basic

hqurlfind3r -d tesla.com

Regex filter URLs

hqurlfind3r -d tesla.com -f ".(jpg|jpeg|gif|png|ico|css|eot|tif|tiff|ttf|woff|woff2)"

Include Subdomains' URLs

hqurlfind3r -d tesla.com -iS

Contribution

Issues and Pull Requests are welcome!