Part B: Procedure to Execute an FMEA Analysis

Part B: Procedure to Execute an FMEA Analysis

Failure Mode and Effects Analysis (FMEA) provides a systematic methodology for analysing both adversarial and non-adversarial system failures. This procedure adapts classical reliability engineering techniques for modern cybersecurity applications.

Step 1: Define System Scope and Failure Context (15 minutes)

Establish the specific system or subsystem under analysis, such as “Document ingestion and RAG query pipeline.” Clearly define what constitutes a “failure” within the assessment scope, distinguishing between technical faults (service crashes), security failures (data exfiltration), and AI-specific failures (hallucinated outputs). This definitional clarity ensures consistent evaluation across the analysis team.

Step 2: Identify Failure Modes Through Causal Mapping (25 minutes)

For each critical system component or process, systematically identify potential failure modes by asking: “How can this component fail in a way that materially impacts the business?”

Document each failure mode using the causal chain format:

[Triggering Cause] → [Failure Mode] → [Business Effect].

 For example: “Malicious metadata in document → AI executes adversarial instructions → Sensitive data exfiltration.” This approach moves beyond simple component analysis to embrace systemic, pathway-based thinking essential for complex modern systems.

Step 3: Quantitative Risk Assessment (30 minutes)

For each identified failure mode, assign numerical ratings across three critical dimensions:

Severity (S): Rate the magnitude of business impact if the failure occurs on a 1-5 scale, where 1 represents negligible impact, and 5 represents catastrophic consequences, such as regulatory breaches or existential organisational harm.

Occurrence (O): Evaluate the likelihood of the failure mode materialising, considering current controls and threat landscape dynamics, where 1 represents a rare occurrence, and 5 indicates a frequent or highly probable event.

Detectability (D): Assess the probability that existing monitoring and detection capabilities will identify the failure before it causes business harm, where 1 indicates almost certain detection and 5 indicates scenarios very unlikely to be detected.

Step 4: Calculate Risk Priority Numbers (10 minutes)

Compute the Risk Priority Number for each failure mode using the formula:

RPN=Severity×Occurrence×Detectability

Higher RPN values indicate higher priority for mitigation investment. This mathematical approach provides objective prioritisation criteria and supports resource allocation decisions.

Step 5: Develop Mitigation Strategies (20 minutes)

For failure modes with the highest RPNs, develop comprehensive mitigation strategies encompassing technical controls (firewalls, encryption), architectural improvements (segmentation, least privilege), and process enhancements (human oversight, approval workflows).

Document the responsible owner, implementation timeline, and expected impact on the S/O/D ratings. Recalculate projected RPNs after implementing the mitigation to validate control effectiveness.

Step 6: Integration with Risk Management Processes (10 minutes)

Transform high-priority failure modes into formal entries in the risk register to ensure alignment with enterprise governance frameworks. Establish continuous monitoring requirements based on detectability insights, feeding detection-related findings into SIEM rules, metrics, alerting systems, and periodic validation testing such as red team exercises.


Scenario 2: AI/RAG - Legal Contract Analysis with Indirect Prompt Injection Risk

Context of the Problem

A global enterprise legal department has deployed an advanced Retrieval-Augmented Generation (RAG) system utilising Large Language Models to analyse confidential vendor contracts, intellectual property agreements, and attorney-client privileged communications.

 This system operates fundamentally as a probabilistic “black box” in which outputs are non-deterministic, vulnerabilities arise from training processes rather than explicit code flaws, and traditional penetration testing methodologies prove entirely ineffective.

The system architecture includes automated ingestion of external vendor contracts (often as PDFs with complex metadata), indexing into vector databases, and AI agent capabilities, including exporting contract summaries to vendor portals and creating internal tickets for unusual clauses. The organisation faces unprecedented risks, including indirect prompt injections, hallucinated legal precedents, and systematic algorithmic bias in contract risk assessment.

Advanced Risk Management

Buy nowLearn more
  • Course Motivation

0.0 Shifting from technical execution to strategic risk management.

  • The Strategic Imperative of the Security Function
  • IBM - Motivation for Risk Analysis in CyberSecurity (11 min)
  • Google - Security Frameworks (30 min)
  • Introduction: The Evolution of Security Management

1. Introduction to ISO/IEC 27005 and information security risk management

  • Introduction: The Evolution of Risk Management Standardisation
  • International Standardisation: ISO 31000 versus ISO 27005
  • The ISO Risk Management and other frameworks
  • The Psychology of Risk Perception and Decision-Making
  • The ISO 31000 Architecture: Principles, Framework, and Process
  • Review of Risk Assessment Methodologies (IEC 31010)
  • Scope, Context, and Criteria
  • Leadership, Governance, and Corporate Commitment
  • Quiz01 - Risk Management [Day01]

2. Information Security Risk Identification, Assessment, and Treatment (ISO/IEC 27005)

Delayed 1 day

  • Identification and description of information security risks
  • Identification of risk owners
  • Assessment of potential consequences
  • Determination of risk levels
  • Comparison of risk analysis results with established risk management criteria
  • Risk prioritization
  • Determination of required controls for risk treatment
  • Risk treatment plan
  • Quiz02 - Risk Identification, Assessment and Treatment [day2]

3 - Risk Acceptance, Communication, Monitoring and Review

Delayed 2 days

  • Key Take aways (Module 01 - Module 02)
  • Quiz03 - Recap - Session 1 & 2
  • Communication and Consultation of Results
  • Documentation of the Risk Analysis Process
  • Documentation of Results
  • Monitoring of Risk-Generating Factors
  • Deep Dive: Navigating Complexity with ISO/TS 31050 and the Risk Intelligence Cycle
  • Future-Looking Challenges for Risk Management and ISO/IEC 27005

4 - Risk Assessment Methodologies

Delayed 3 days

  • The Methodological Shift: Transcending Traditional Frameworks
  • Technique 1: STRIDE for Cloud and PaaS Architectures
  • Technique 2: Subjective Evaluation of Opaque AI Risks
  • FMEA, Red Teaming, and Risk Register Integration
  • Risk Monitoring Processes
  • Part B: Procedure to Execute an FMEA Analysis

05 - ISO 27005 Risk Assessment Using FMEA

Delayed 4 days

  • Process Overview - Lab: AI and Cloud Services
  • Quiz - Simulation exam
  • Quiz - summary