keyhunter

Author	SHA1	Message	Date
salvacybersec	35fa4ad174	feat(16-01): add URLhaus recon source - URLhausSource searches abuse.ch URLhaus API for malicious URLs with API keys - Credentialless source (Enabled always true, no API key needed) - Tag lookup with payload endpoint fallback - ciLogKeyPattern used for content matching - Tests with httptest mocks for happy path and empty results Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 16:45:23 +03:00
salvacybersec	e02bad69ba	feat(16-01): add VirusTotal and IntelligenceX recon sources - VirusTotalSource searches VT Intelligence API for files containing API keys - IntelligenceXSource searches IX archive with 3-step flow (search/results/read) - Both credential-gated (Enabled returns false without API key) - ciLogKeyPattern used for content matching - Tests with httptest mocks for happy path and empty results Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 16:44:41 +03:00
salvacybersec	83a1e83ae5	fix(phase-15): update register tests for 67 total sources (Phase 10-15)	2026-04-06 16:37:48 +03:00
salvacybersec	d02cdcc7e0	feat(15-03): add Grafana and Sentry ReconSource implementations - GrafanaSource: search dashboards via /api/search, fetch detail via /api/dashboards/uid - SentrySource: search issues via /api/0/issues, fetch events for key detection - Register all 5 log aggregator sources in RegisterAll (67 sources total) - Tests use httptest mocks for each API endpoint	2026-04-06 16:31:14 +03:00
salvacybersec	bc63ca1f2f	feat(15-03): add Elasticsearch, Kibana, and Splunk ReconSource implementations - ElasticsearchSource: POST _search API with query_string, parse hits._source - KibanaSource: GET saved_objects/_find API with kbn-xsrf header - SplunkSource: GET search/jobs/export API with newline-delimited JSON parsing - All sources use ciLogKeyPattern for key detection - Tests use httptest mocks for each API endpoint	2026-04-06 16:31:05 +03:00
salvacybersec	fcc1a769c5	feat(15-01): add Discord, Slack, DevTo recon sources and wire all six - DiscordSource uses dorking approach against configurable search endpoint - SlackSource uses dorking against slack-archive indexers - DevToSource searches dev.to API articles list + detail for body_markdown - RegisterAll extended to include all 6 Phase 15 forum sources - All credentialless, use ciLogKeyPattern for key detection	2026-04-06 16:29:52 +03:00
salvacybersec	282c145a43	feat(15-01): add StackOverflow, Reddit, HackerNews recon sources - StackOverflowSource searches SE API v2.3 search/excerpts endpoint - RedditSource searches Reddit JSON API with custom User-Agent - HackerNewsSource searches Algolia HN API for comments - All credentialless, use ciLogKeyPattern for key detection - Tests use httptest mock servers with API key patterns	2026-04-06 16:28:23 +03:00
salvacybersec	37393a9b5f	feat(15-02): wire Trello, Notion, Confluence, GoogleDocs into RegisterAll - RegisterAll extended to 56 sources (52 Phase 10-14 + 4 Phase 15 collab) - All four sources credentialless, no new SourcesConfig fields needed	2026-04-06 13:50:56 +03:00
salvacybersec	5d568333c7	feat(15-02): add Confluence and GoogleDocs ReconSource implementations - ConfluenceSource searches exposed instances via /rest/api/content/search CQL - GoogleDocsSource uses dorking + /export?format=txt for plain-text scanning - HTML tag stripping for Confluence storage format - Both credentialless, tests with httptest mocks confirm findings	2026-04-06 13:50:14 +03:00
salvacybersec	7bb614678d	feat(15-02): add Trello and Notion ReconSource implementations - TrelloSource searches public Trello boards via /1/search API - NotionSource uses dorking to discover and scrape public Notion pages - Both credentialless, follow established Phase 10 pattern - Tests with httptest mocks confirm Sweep emits findings	2026-04-06 13:50:04 +03:00
salvacybersec	4246db8294	fix: resolve Phase 14 merge conflicts across CI/CD, archive, and frontend sources	2026-04-06 13:42:54 +03:00
salvacybersec	27624e0ec7	merge: phase 14-04 register wiring	2026-04-06 13:39:32 +03:00
salvacybersec	7ef6c2ac34	feat(14-04): wire all 12 Phase 14 sources in RegisterAll (45 -> 52 total) - Add CircleCIToken to SourcesConfig with env/viper lookup in cmd/recon.go - Register 7 new sources: travisci, ghactions, circleci, jenkins, wayback, commoncrawl, jsbundle - Update register_test.go expectations from 45 to 52 sources - Add integration test handlers + registrations for all 12 Phase 14 sources - Integration test now validates 52 sources end-to-end	2026-04-06 13:34:18 +03:00
salvacybersec	169b80b3bc	feat(14-04): implement 7 Phase 14 sources (CI/CD, archives, JS bundles) - TravisCISource: scrapes public Travis CI build logs for API key leaks - GitHubActionsSource: searches Actions workflow logs (requires GitHub token) - CircleCISource: scrapes CircleCI pipeline logs (requires CircleCI token) - JenkinsSource: scrapes public Jenkins console output for leaked secrets - WaybackMachineSource: searches Wayback Machine CDX for archived key leaks - CommonCrawlSource: searches Common Crawl index for exposed pages - JSBundleSource: probes JS bundles for embedded API key literals	2026-04-06 13:34:09 +03:00
salvacybersec	3a4e9c11bf	fix: add CircleCIToken to SourcesConfig	2026-04-06 13:22:25 +03:00
salvacybersec	095b90ec07	merge: phase 14-03 frontend leaks	2026-04-06 13:21:39 +03:00
salvacybersec	aeebf37174	merge: phase 14 wave 1 all conflicts resolved	2026-04-06 13:21:32 +03:00
salvacybersec	0a8be81f0c	feat(14-03): wire 5 frontend leak sources in RegisterAll (40 -> 45 sources) - Register SourceMapSource, WebpackSource, EnvLeakSource, SwaggerSource, DeployPreviewSource - Update test expectations from 40 to 45 sources	2026-04-06 13:19:34 +03:00
salvacybersec	7d8a4182d7	feat(14-03): implement SwaggerSource and DeployPreviewSource with tests - SwaggerSource probes OpenAPI doc endpoints for API keys in example/default fields - DeployPreviewSource scans Vercel/Netlify preview URLs for __NEXT_DATA__ env leaks - Both implement ReconSource, credentialless, with httptest-based tests	2026-04-06 13:18:18 +03:00
salvacybersec	e0f267f7bf	feat(14-01): add 5 CI/CD log sources (GitHubActions, TravisCI, CircleCI, Jenkins, GitLabCI) - GitHubActionsSource: searches GitHub code search for workflow files with provider keywords (token-gated) - TravisCISource: queries Travis CI v3 API for public build logs (credentialless) - CircleCISource: queries CircleCI v2 pipeline API for build pipelines (token-gated) - JenkinsSource: queries open Jenkins /api/json for job build consoles (credentialless) - GitLabCISource: queries GitLab projects API for CI-enabled projects (token-gated) - RegisterAll extended to 45 sources (40 Phase 10-13 + 5 Phase 14) - Integration test updated with fixtures for all 5 new sources - cmd/recon.go wires CIRCLECI_TOKEN env var	2026-04-06 13:17:31 +03:00
salvacybersec	b57bd5e7d9	feat(14-03): implement SourceMapSource, WebpackSource, EnvLeakSource with tests - SourceMapSource probes .map files for original source containing API keys - WebpackSource scans JS bundles for inlined NEXT_PUBLIC_/REACT_APP_/VITE_ env vars - EnvLeakSource probes common .env paths for exposed environment files - All three implement ReconSource, credentialless, with httptest-based tests	2026-04-06 13:17:07 +03:00
salvacybersec	c5332454b0	feat(14-02): add WaybackMachine + CommonCrawl recon sources - WaybackMachineSource queries CDX API for historical snapshots - CommonCrawlSource queries CC Index API for matching pages - Both credentialless, rate-limited at 1 req/5s, RespectsRobots=true - RegisterAll extended to 42 sources (40 Phase 10-13 + 2 Phase 14) - Full httptest-based test coverage for both sources	2026-04-06 13:16:13 +03:00
salvacybersec	9b005e78bb	test(13-04): add integration test handlers for all 12 Phase 13 sources (40 total) - Add httptest mux handlers for npm, pypi, crates, rubygems, maven, nuget, goproxy, packagist, dockerhub, k8s, terraform, helm - Register all 12 Phase 13 sources with BaseURL prefix routing - Update expected source types and count assertions from 28 to 40	2026-04-06 13:03:27 +03:00
salvacybersec	c16f5feaee	feat(13-04): wire all 12 Phase 13 sources into RegisterAll (40 total) - Add 8 package registry sources (npm, pypi, crates, rubygems, maven, nuget, goproxy, packagist) - Update register_test to assert 40 sources in sorted list - Update Phase 12 integration test count from 32 to 40	2026-04-06 12:59:11 +03:00
salvacybersec	a607082131	merge: phase 13 resolve conflicts	2026-04-06 12:57:29 +03:00
salvacybersec	7e0e401266	feat(13-03): wire 4 Phase 13 sources into RegisterAll (32 total) - Register DockerHub, Kubernetes, Terraform, Helm as credentialless sources - Update RegisterAll tests and integration test to expect 32 sources	2026-04-06 12:55:52 +03:00
salvacybersec	0727b51d79	feat(13-03): implement TerraformSource and HelmSource - Terraform searches registry.terraform.io v1 modules API with namespace/name/provider URLs - Helm searches artifacthub.io for charts (kind=0) with repo/chart URL construction - Both sources: context cancellation, nil registry, httptest-based tests	2026-04-06 12:53:58 +03:00
salvacybersec	9907e2497a	feat(13-01): implement CratesIOSource and RubyGemsSource with httptest tests - CratesIOSource searches crates.io JSON API with custom User-Agent header - RubyGemsSource searches rubygems.org search.json API for gem matches - Both credentialless; CratesIO 1 req/s burst 1, RubyGems 1 req/2s burst 2 - Tests verify User-Agent header, Sweep findings, ctx cancellation, metadata	2026-04-06 12:53:41 +03:00
salvacybersec	018bb165fe	feat(13-02): implement GoProxySource and PackagistSource with tests - GoProxySource parses pkg.go.dev HTML search results for module paths - PackagistSource queries Packagist JSON search API for PHP packages - GoProxy regex requires domain dot to filter non-module paths	2026-04-06 12:53:37 +03:00
salvacybersec	3a8123edc6	feat(13-03): implement DockerHubSource and KubernetesSource - DockerHub searches hub.docker.com v2 search API for repos matching provider keywords - Kubernetes searches Artifact Hub for operators/manifests with kind-aware URL paths - Both sources: context cancellation, nil registry, httptest-based tests	2026-04-06 12:52:45 +03:00
salvacybersec	4b268d109f	feat(13-01): implement NpmSource and PyPISource with httptest tests - NpmSource searches npm registry JSON API for provider keywords - PyPISource scrapes pypi.org search HTML for project links - Both credentialless, rate-limited at 1 req/2s, burst 2 - httptest-based tests verify Sweep, ctx cancellation, Name/Rate/Burst	2026-04-06 12:52:31 +03:00
salvacybersec	23613150f6	feat(13-02): implement MavenSource and NuGetSource with tests - MavenSource queries Maven Central Solr API for provider keyword matches - NuGetSource queries NuGet gallery search API with projectUrl fallback - Both sources: httptest fixtures, ctx cancellation, metadata tests	2026-04-06 12:52:27 +03:00
salvacybersec	f0f22191ef	test(12-04): add end-to-end SweepAll integration test across all 28 sources - Extend integration test with Phase 12 IoT scanner fixtures (shodan, censys, zoomeye, fofa, netlas, binaryedge) - Add cloud storage fixtures with correct formats (S3 XML, GCS JSON, Azure EnumerationResults XML) - Add TestRegisterAll_Phase12 verifying 28 sources, enabled/disabled states - Add TestRegisterAll_Phase12_SweepAllNoPanic for timeout resilience - Update existing register_test.go from 18 to 28 expected sources	2026-04-06 12:41:59 +03:00
salvacybersec	870431658d	feat(12-04): wire all 10 Phase 12 sources into RegisterAll + cmd/recon.go credentials - Add 8 Phase 12 credential fields to SourcesConfig (Shodan, Censys, ZoomEye, FOFA, Netlas, BinaryEdge) - Register all 10 Phase 12 sources in RegisterAll (6 IoT + 4 cloud storage) - Wire env/viper credential lookup in cmd/recon.go buildReconEngine - Update reconCmd Long description to mention Phase 12 sources	2026-04-06 12:31:57 +03:00
salvacybersec	ade609d562	merge: phase 12 resolve conflicts	2026-04-06 12:27:23 +03:00
salvacybersec	c54e9c73ca	merge: phase 12 resolve conflicts	2026-04-06 12:27:23 +03:00
salvacybersec	13905eb5ee	feat(12-03): implement AzureBlobScanner, DOSpacesScanner, and all cloud scanner tests - AzureBlobScanner enumerates public Azure Blob containers with XML listing - DOSpacesScanner enumerates public DO Spaces across 5 regions (S3-compatible XML) - httptest-based tests for all four scanners: sweep, empty registry, ctx cancel, metadata - All sources credentialless, compile-time interface assertions	2026-04-06 12:26:01 +03:00
salvacybersec	47d542b9de	feat(12-03): implement S3Scanner and GCSScanner cloud storage recon sources - S3Scanner enumerates public AWS S3 buckets by provider keyword + suffix pattern - GCSScanner enumerates public GCS buckets with JSON listing format - Shared bucketNames() helper and isConfigFile() filter for config-pattern files - Both credentialless (anonymous HTTP), always Enabled, BaseURL override for tests	2026-04-06 12:25:55 +03:00
salvacybersec	6443e63b9a	test(12-01): add httptest tests for Shodan, Censys, ZoomEye sources - Each source tested with mock API server returning 2 results - Verifies Enabled() disabled when credentials empty - Verifies Sweep emits correct findings with proper SourceType - Verifies context cancellation propagation	2026-04-06 12:24:18 +03:00
salvacybersec	d6c35f4f14	test(12-02): add httptest tests for FOFA, Netlas, BinaryEdge sources - FOFA: mock JSON with 2 results, credential validation, context cancellation - Netlas: mock JSON with 2 items, X-API-Key header check, context cancellation - BinaryEdge: mock JSON with 2 events, X-Key header check, context cancellation - All verify correct finding count, source type, and disabled state Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 12:24:11 +03:00
salvacybersec	270bbbfb49	feat(12-02): implement FOFA, Netlas, BinaryEdge recon sources - FOFASource searches FOFA API with base64-encoded queries (email+key auth) - NetlasSource searches Netlas API with X-API-Key header auth - BinaryEdgeSource searches BinaryEdge API with X-Key header auth - All three implement recon.ReconSource with shared Client retry/backoff Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 12:24:04 +03:00
salvacybersec	f5d8470aab	feat(12-01): implement Shodan, Censys, ZoomEye recon sources - ShodanSource searches /shodan/host/search with API key auth - CensysSource POSTs to /v2/hosts/search with Basic Auth - ZoomEyeSource searches /host/search with API-KEY header - All use shared Client for retry/backoff, LimiterRegistry for rate limiting	2026-04-06 12:23:06 +03:00
salvacybersec	bebc3e7a0b	test(11-03): add end-to-end SweepAll integration test across all 18 sources - Extend httptest mux with fixtures for Google, Bing, DuckDuckGo, Yandex, Brave - Add Pastebin (routed /pb/), GistPaste (/gp/), PasteSites (injected platform) - Assert all 18 SourceTypes emit at least one finding via SweepAll	2026-04-06 12:06:27 +03:00
salvacybersec	3250408f23	feat(11-03): wire 18 sources into RegisterAll + credential wiring in cmd/recon.go - Extend SourcesConfig with GoogleAPIKey, GoogleCX, BingAPIKey, YandexUser, YandexAPIKey, BraveAPIKey - RegisterAll registers 8 Phase 11 sources alongside 10 Phase 10 sources (18 total) - cmd/recon.go reads search engine API keys from env vars and viper config - Guardrail tests updated to assert 18 sources	2026-04-06 12:02:11 +03:00
salvacybersec	a53d952518	Merge branch 'worktree-agent-a27c3406'	2026-04-06 11:58:19 +03:00
salvacybersec	ed148d47e1	feat(11-02): add PasteSitesSource multi-paste aggregator - Aggregates dpaste, paste.ee, rentry, hastebin into single source - Follows SandboxesSource multi-platform pattern with per-platform error isolation - Two-phase search+raw-fetch with keyword matching against provider registry	2026-04-06 11:55:44 +03:00
salvacybersec	770705302c	feat(11-01): add DuckDuckGoSource, YandexSource, and BraveSource - DuckDuckGoSource scrapes HTML search (no API key, always enabled, RespectsRobots=true) - YandexSource uses Yandex XML Search API (user+key required, XML response parsing) - BraveSource uses Brave Search API (X-Subscription-Token header, JSON response) - All three follow established error handling: 401 aborts, transient continues, ctx cancellation returns	2026-04-06 11:54:42 +03:00
salvacybersec	7272e65207	feat(11-01): add GoogleDorkSource and BingDorkSource with formatQuery updates - GoogleDorkSource uses Google Custom Search JSON API (APIKey+CX required) - BingDorkSource uses Bing Web Search API v7 (Ocp-Apim-Subscription-Key header) - formatQuery now handles google/bing/duckduckgo/yandex/brave dork syntax - Both sources follow established pattern: retry via Client, rate limit via LimiterRegistry	2026-04-06 11:54:36 +03:00
salvacybersec	3c500b5473	feat(11-02): add PastebinSource and GistPasteSource for paste site scanning - PastebinSource: two-phase search+raw-fetch with keyword matching - GistPasteSource: scrapes gist.github.com public search (no auth) - Both implement recon.ReconSource with httptest-based tests	2026-04-06 11:53:00 +03:00
salvacybersec	118decbb3e	fix(phase-10): add --sources filter flag and DB persistence to recon full Closes 2 verification gaps: 1. --sources=github,gitlab flag filters registered sources before sweep 2. Findings persisted to SQLite via storage.SaveFinding after dedup Also adds Engine.Get() method for source lookup by name.	2026-04-06 11:36:19 +03:00

1 2 3 4

165 Commits