The Osint Toolkit
The Osint Toolkit
The Osint Toolkit
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
Baidu Maps
http://map.baidu.com/
A Google Maps alternative from the Chinese search company Baidu. Check
out the Baidu view of the world in relation to border disputes.
Bing Maps
https://www.bing.com/maps/
Bing Maps are an alternative to the highly popular Google Maps. Bing maps
contain details of when its satalite image data was last updated allowing
you to judge how accurate the information is.
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
Bug Bounty Toolkit
https://medium.com/bugbountyhunting/bug-bounty-toolkit-aa36f4365f3f
Camera Trace
http://www.cameratrace.com/
Find additional images taken by the same physical camera using the serial
number stored in image EXIF metadata.
Common Crawl
http://commoncrawl.org/ “We build and maintain an open repository of
web crawl data that can be accessed and analyzed by anyone. The Common
Crawl corpus contains petabytes of data collected over the last 7 years. It
contains raw web page data, extracted metadata and text extractions.”
Use the Common Crawl API to search their indexed crawl data for sites of
interest, then pull the full dataset from their Amazon S3 repository for
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
further analysis. Common Crawl contains snapshots of websites taken over
the past 7 years with metadata about the website and services providing it.
Corona
http://corona.cast.uark.edu/
“CORONA is the codename for the United States’ rst photographic spy
satellite mission, in operation from 1960–1972. During that time, CORONA
satellites took high-resolution images of most of the earth’s surface, with
particular emphasis on Soviet bloc countries and other political hotspots in
order to monitor military sites and produce maps for the Department of
Defense. The more than 800,000 images collected by the CORONA missions
remained classi ed until 1995 when an executive order by President Bill
Clinton made them publicly available through the US Geological Survey.
Because CORONA images preserve a high-resolution picture of the world as
it existed in the 1960s, they constitute a unique resource for researchers
and scientists studying environmental change, agriculture, geomorphology,
archaeology and other elds.”
Cree.py
http://www.geocreepy.com “A Geolocation OSINT Tool. O ers geolocation
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
information gathering through social networking platforms.”
Use Cree.py to nd the dates, times, and locations of social media posts for
a given person from platforms such as Twitter, Instagram, and Flicker. This
will allow you to track the movements of named individuals based on their
geo-tagged social media posts. Frequent posts from speci c locations
indicate their own and family/friend’s residences.
Datasploit
https://github.com/upgoingstar/datasploit
Daum Maps
http://map.daum.net/
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
A Korean alternative to Google Maps
elasticArchive
https://github.com/craighays/elasticArchive
Log and perform full-text searches on all of your web tra c with Mitmproxy
and ElasticArchive
Folium
https://github.com/python-visualization/folium “Folium builds on the
data wrangling strengths of the Python ecosystem and the mapping
strengths of the Lea et.js library. Manipulate your data in Python, then
visualize it in on a Lea et map via Folium.”
Use Folium to collate and display location linked data on Lea et maps
through the Python programming language.
FotoForensics
http://fotoforensics.com/
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
An online EXIF metadata viewer. In addition to the regular metadata it
provices an Error Level Analysis (ELA) allowing you to see the compression
rate of the image changes highlighting any resaves, crops, stretches,
additions, and other modi cations.
Geopy
https://github.com/geopy/geopy “A Python 2 and 3 client for several
popular geocoding web services. Geopy makes it easy for Python developers
to locate the coordinates of addresses, cities, countries, and landmarks
across the globe using third-party geocoders and other data sources.
Gephy
https://gephi.org/ “Gephi is the leading visualization and exploration
software for all kinds of graphs and networks.”
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
Use Gephy to visualise networks and relationships between people,
websites, objects, companies, locations, or anything else that is linked. Find
rst, second, third, etc. degrees of separation between targets.
Gitrob
https://github.com/michenriksen/gitrob
“Gitrob is a command line tool which can help organizations and security
professionals nd sensitive information lingering in publicly available les
on GitHub. The tool will iterate over all public organization and member
repositories and match lenames against a range of patterns for les that
typically contain sensitive or dangerous information.”
Google Chrome
https://www.google.co.uk/chrome/browser/desktop/
Use Google Chrome’s developer tools to inspect the source code of websites,
view network activity, a very powerful Javascript console, and a variety of
plugins to expand it’s capability. Use with Hunch.ly and Website IP to cache
all content locally along with the IP address of the server displaying the
content.
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
Google Earth
https://earth.google.com/ “lets you y anywhere on Earth to view satellite
imagery, maps, terrain, 3D buildings, from galaxies in outer space to the
canyons of the ocean.”
Use Google Earth for location reconnaissance. Import your own KML data
to plot information against physical locations and view geo-linked data
relevant to the location
Online map tools and 360 images taken from millions of locations around
the globe. Use these to gather information about a location without needing
to physically be there. Combine with Panoramio, Instagram, Flickr, and
Twitter data to build a visual picture of a location from multiple sources and
individuals. Useful for nding the locations of images and videos without
geo-tag data embedded.
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
Google URL Shortener
https://goo.gl
Not much use for OSINT alone, but add .info to the end of any Google
shortened url to view analytics gathered on the link such as number of
clicks, demographics, and locations of the visitors.
HERE
https://maps.here.com/
Hunch.ly
https://www.hunch.ly/ “Inspector Hunchly toils in the background of your
web browser to track, analyze and store web pages while you perform
online investigations. Forgets nothing, keeps everything. Inspector Hunchly
Hates SPAM. You will never get any from us. Ever.”
Use Hunch.ly to download, index, and search anything you’ve ever looked
at online. Ever. All content works locally without relying on internet
resources so even if it gets deleted you can still see it.
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
Imagga
http://imagga.com/ “Imagga is an Image Recognition Platform-as-a-
Service providing Image Tagging APIs for developers & businesses to build
scalable, image intensive cloud apps.”
View cached versions of websites over time. Useful for nding information
that has since been updated or deleted.
IPinfo.io
https://ipinfo.io/AS36040
Lea et
http://lea etjs.com/ “Lea et is the leading open-source JavaScript library
for mobile-friendly interactive maps.”
Use Lea et to create interactive web based GIS visualisations using data
mined through other tools. Use with Folium.
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
Maltego
https://www.paterva.com/
Meanpath
https://meanpath.com/ “A search engine that captures the various bits of
code, CSS and HTML across hundreds of millions of websites. This enables
you to search for bits of code that might be not be indexed by other search
providers.”
Navar Maps
http://map.naver.com/
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
A Korean alternative to Google Maps.
NerdyData
https://nerdydata.com/search
Use NerdyData to search for other sites that have included the same source
code in their pages. Useful for nding other sites with the same Google
Analytics, Adsense, and A liate accounts linked to them.
NetworkX
https://networkx.github.io/ “NetworkX is a Python language software
package for the creation, manipulation, and study of the structure,
dynamics, and functions of complex networks.”
OnionScan
https://github.com/s-rah/onionscan “The purpose of this tool is to make
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
you a better onion service provider. You owe it to yourself and your users to
ensure that attackers cannot easily exploit and deanonymize.”
OpenCorporates
https://opencorporates.com/ “The largest open database of companies in
the world”
OpenStreetMap
http://www.openstreetmap.org/ “OpenStreetMap is a map of the world,
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
created by people like you and free to use under an open license.”
Panoramio
http://www.panoramio.com/api/data/api.html “Using Panoramio API you
can display the photos from Panoramio on your own web site. Geolocated
photos from Panoramio are great to enrich your maps or illustrate
information where location is a important factor”
Pillow
https://pypi.python.org/pypi/Pillow/ “The Python Imaging Library adds
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
image processing capabilities to your Python interpreter. This library
provides extensive le format support, an e cient internal representation,
and fairly powerful image processing capabilities.”
Use Pillow to manage, manipulate, and process images in bulk using the
Python programming language. Combine with other image APIs such as
TinEye and Imagga to quickly tag and search for large numbers of image
documents.
pyPDF
https://pypi.python.org/pypi/pyPdf/ “A Pure-Python library built as a PDF
toolkit”.
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
Use pyPDF for interacting, PDF les. Capable of a variety of operations
including opening, scraping, editing, splitting, merging, encrypting and
decrypting documents, all through the Python programming language.
Python does not come with an HTTP library by default. Install the Requests
library for all interactions with websites and online services that don’t
already have their own pre-made python library.
SameID
http://sameid.net/
Use SameID to nd websites using the same Google Analytics account based
on the UA-1234567-XX tag. Also compares AdSense, Amazon, Clickbank,
Addthis services.
SASGIS
http://sasgis.ru/sasplaneta/
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
SASGIS is a russian made tool for “viewing and downloading high-
resolution satellite imagery and conventional maps submitted by such
services as the Google Earth , the Google the Maps , the Bing the Maps ,
DigitalGlobe’s , “ Kosmosnimki “, Yandex , Yahoo! The Maps , VirtualEarth ,
Gurtam , by OpenStreetMap , eAtlas , the iPhone maps, maps of the General
Sta , and others.”
Shodan
https://www.shodan.io/ “Shodan is the world’s rst search engine for
Internet-connected devices.”
Rather than searching for website content, Shodan allows you to search for
the back-end services providing the content. A public API is available
allowing you to search straight from your programming language of choice.
SpyOnWeb
http://www.spyonweb.com/
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
Stolen Camera Finder
http://www.stolencamera nder.com/
Find additional images taken by the same physical camera using the serial
number stored in image EXIF metadata.
Sublist3r
https://github.com/aboul3la/Sublist3r
Tesseract
https://github.com/tesseract-ocr
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
Text Mechanic Number Generator
http://textmechanic.com/text-tools/numeration-tools/generate-list-
numbers/
TimelineJS
http://timeline.knightlab.com/ “Easy-to-make, beautiful timelines.”
Use TimelineJS for creating and and publishing web viewable timelines of
events. An e ective tool for storytelling and delivering evidence following
an investigation.
TinEye
https://www.tineye.com/ “Reverse Image Search. 15.7 billion images
indexed and growing.”
Use TinEye to nd out where an image came from, how it is being used, if
modi ed versions of the image exist, or to nd higher resolution versions. It
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
will also nd visually similar images which can provide further evidence
that your version has been doctored. The TinEye API allows you to
programatically perform image searches saving you plenty of time and
manual labour.
Topia.termextract
https://pypi.python.org/pypi/topia.termextract/ “Determines important
terms within a given piece of content. It uses linguistic tools such as Parts-
Of-Speech (POS) and some simple statistical analysis to determine the
terms and their strength.”
TwitterAPI
https://apps.twitter.com/
Use the Twitter API to create your own private apps to search and scrape
Tweets by location, hashtag, search strings, and people. Combine with
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
Download follow and follower lists as well as public lists (custom timelines)
on a per-user basis. Lea et and Folium to map to physical locations or with
Gephy to identify relationships between individuals.
WebsiteIP
https://chrome.google.com/webstore/detail/website-
ip/ghbmhlgniedlklkpimlibbaoomlpacmk “Simply adds the IP of the website
you are viewing to the bottom right.”
Use the WebsiteIP Chrome extension to quickly view the IP address of the
website you’re currently on. More useful that you’d have thought before you
installed it, and as it is appended to the body of the HTML document it gets
captured in Hunch.ly for later analysis.
Who.is
https://who.is/ “WHOIS Search, Domain Name, Website, and IP Tools”
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
Useful for identifying the current owner of websites and the servers and
ISPs hosting services under a given domain name.
Whois.domaintools.com
http://whois.domaintools.com/
Whoisology
https://whoisology.com/ “Whoisology is a searchable domain name
reverse whois / ownership database with over one billion individual
domain name records that are updated pretty regularly. Reverse whois is
used for cyber crime investigation / InfoSec, corporate intelligence, legal
research, business development, and for good ol’ fashioned poking around.”
Another tool for nding current and previous owners of domain names and
other domains that they have registered.
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
Wikimapia
http://wikimapia.org/ “A privately owned open-content collaborative
mapping project, that utilizes an interactive “clickable” web map with a
geographically-referenced wiki system, with the aim to mark and describe
all geographical objects in the world.”
Yandex Maps
https://yandex.com/maps/
A free tool for Windows, Linux and OS X for creating network, relationship
and other diagrams.
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
Osint Open Source Intelligence Tools Security
144 claps
WRITTEN BY
Craig Hays Follow
osint Follow
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
More From Medium
I gave my Tinder data to a professional data analyst Karl Pearson’s correlation(Pearson’s r)and Spearman’s
correlation using Python
Molly India Nye in The Startup
Fahad vp
Hashing Technique and its importance. What Coronavirus Has to Do With Earth Day
Bharath Boggarapu in The Startup Ripley Cleghorn in Nightingale
Machine Learning for Social Good: Charity Donations Evaluation Metrics for Regression Problems
Brett & Butter Edward Ma in Towards AI — Multidisciplinary Science
Journal
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
About Help Legal
Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD