Skip to content

Discovery

Scheduled Discovery is our internal team’s proactive system for automatically hunting for threats related to your organization across the web. Think of it as an “automated search party” that we manage on your behalf. You don’t use this tool yourself, but you see the results: a steady stream of verified, relevant threats delivered directly to your dashboard.\n

  • The Process from Web to Your Dashboard:\n

    1. The Starting Point: Your Assets: The hunt begins with your organization’s digital assets, such as your domains, brand names, and IP addresses. These act as the keywords for our search.\nFind more details in @Asset Management
    2. The Hunt: Automated Scanning: Our team schedules specialized scripts to run at regular intervals (e.g., daily or weekly). These scripts automatically scan the web, searching for potential risks connected to your assets.
    3. The Human Touch: Verification & Classification: Every potential threat found by the automated scans is manually reviewed by our security analysts. They first confirm the threat is real and relevant to you, filtering out all false positives. They then categorize the verified threat (e.g., ‘Phishing Website,’ ‘Data Leak,’ ‘Malicious Mobile App’).
    4. The Result: Actionable Intelligence: Once verified and classified, the detection is sent to your dashboard. You receive a clean, contextualized alert that you can trust, saving you the time and effort of investigating raw findings.

\

Discovery runs automatically at set times. Each time it runs, it looks at a starting list of items (called assets), performs steps on them (called flows), and records anything it finds (called detections). Sometimes those findings create new things to look at next. It keeps doing this in layers until it reaches a limit.

WordPlain Meaning
ScheduleWhen it should run (repeat plan)
RunOne execution of the process
WorkerThe part that does the work
FlowOne defined step the system performs
PipelineThe ordered list of flows
AssetA starting item
TargetAn item being processed right now
DetectionA result produced by a flow
DepthLayer number (first pass, second pass, etc.)
CheckpointSaved progress so it can resume
Audit LogHistory record of what happened
  1. Run starts when the schedule time arrives.

  2. Loads starting assets and which flows to apply.

  3. Turns assets into targets to process.

  4. Runs each flow over the targets in order.

  5. Stores detections (findings) as they appear.

  6. May create new targets for the next depth (next layer).

  7. Repeats until depth limit or a stop condition.

  8. Marks final status (done, paused, failed, timeout, etc.).

    \

Start → Load → Build targets → Run flows → Record detections → Generate next targets → Increase depth → Repeat or stop → Finalize.

During the run it saves:

  • Current depth
  • Which target it is on
  • Which flow it is in
  • What has been found so far

If interrupted, it can continue later without redoing finished work. Currently these saved snapshots are internal only (not shown live in the user interface).

  • ACTIVE = Will keep starting runs.
  • INACTIVE = Temporarily off.
  • ONGOING = A run is happening now.
  • ENDED = Stopped permanently.
  • PAUSED = Manually paused.
  • FAILED / TIMEOUT_FAILED / TIMEOUT_TERMINATED / STUCK = Stopped because of a problem or time limit.

SCHEDULED = Waiting to start. RUNNING = In progress. COMPLETED = Finished normally. FAILED = Ended with an error. TIMEOUT = Took too long and was stopped.

  • PENDING = Not started.
  • INPROGRESS = Working now.
  • SUCCESS = Finished okay.
  • ERROR = Failed.
  • PAUSED = Manually paused.
ReasonWhat Happens
Manual pauseStops and can be resumed
Flow errorStops or marks failure
Time limit reachedForce stop
System issueLater resume from checkpoint

On resume it reads the last checkpoint and continues from there. Already completed parts are not repeated.

Flows can send small internal updates (percent, counts, notes). These help internal tracking. They are not currently displayed live in the interface.

You can see new detections as they are produced. You cannot currently see: live percent, which target is running, depth progress, or internal checkpoints.

Success = All planned layers done. Paused = Stopped on purpose and can continue. Failed / Timeout = Stopped early; may restart using last checkpoint.

AreaCurrent State
Can resumeYes, via checkpoints
See live progressNot yet
See detectionsYes, as they appear
Control depthYes
Recover from stopYes
Full transparencyPartial (some internal data hidden)

Discovery, Run, Worker, Flow, Pipeline, Asset, Target, Detection, Depth, Checkpoint, Audit Log.

Internally it watches for long inactivity. If limits are exceeded it may mark STUCK or TIMEOUT. To investigate now you must look at logs (UI does not show live internal steps yet).

16. Planned Future Improvements (Not Live Yet)

Section titled “16. Planned Future Improvements (Not Live Yet)”
  • Live depth and target counters
  • Real-time percent progress
  • Visible checkpoint timeline
  • Per-flow timing display
  • Clear resume reason display

These short examples show when and why you might change settings or take action.

  1. Improve data quality by increasing depth
  • Situation: Results at the current depth feel incomplete or lack context.
  • Action: Increase depth by 1 or 2. This lets Discovery follow additional leads and produce more related findings.
  • Result: More detections and richer context for each original item.
  1. Save time and resources by lowering depth
  • Situation: Runs take too long or produce many irrelevant items.
  • Action: Lower the depth to limit how far Discovery expands.
  • Result: Faster runs and fewer findings to review.
  1. Narrow results by changing flows
  • Situation: You see many unrelated detections.
  • Action: Turn off flows that produce noisy results, or enable only flows that match your needs.
  • Result: Cleaner, more relevant findings.
  1. Handle interruptions safely
  • Situation: A run stops unexpectedly (system issue, manual stop).
  • Action: Ensure checkpoints and resume are enabled. Restart the run from the last checkpoint.
  • Result: You continue from the same place without redoing completed work.
  1. Investigate a single item more deeply
  • Situation: One important asset needs extra investigation.
  • Action: Run a one-off job for that asset with a higher depth and specific flows.
  • Result: Focused, deeper data for the item of interest.
  1. Tune for performance during busy times
  • Situation: System or network heavy during business hours.
  • Action: Schedule runs during off-peak hours and reduce depth temporarily.
  • Result: Lower impact on regular operations.
  1. Iterative approach to discovery
  • Situation: New to a dataset and unsure what depth or flows are best.
  • Action: Start with depth 0 or 1 and a small set of flows. Review results, then incrementally increase depth or add flows.
  • Result: Controlled exploration and quicker learning about what yields useful findings.
  1. Reduce noise after a broad run
  • Situation: A broad run produced many detections, some irrelevant.
  • Action: Narrow flows and run targeted lower-depth jobs on specific subsets of assets.
  • Result: Cleaner results and less review work.
  1. Quick checks versus deep scans
  • Situation: You need a fast health check or a deep analysis.
  • Action: Use low depth and few flows for fast checks; use higher depth and more flows for deep scans.
  • Result: Flexible balance between speed and detail.