Troubleshooting Guide: Diagnosing Common Hardware and Software Issues

The Systematic Troubleshooting Approach

Random guessing wastes time and often makes problems worse. The CompTIA A+ troubleshooting methodology provides a repeatable, six-step framework that IT support professionals in the UK and worldwide use to diagnose faults efficiently. This methodology is also closely aligned with the ITIL incident management process used in many UK organisations.

The six steps are designed to be followed in sequence. Skipping a step, particularly the identification and documentation phases, is the most common reason that tickets are reopened or issues recur.

Identify the problem — Gather information from the user. Ask open-ended questions about what happened, when it started, and what changed. Check logs, error messages, and recent system changes. Replicate the issue if possible.
Establish a theory of probable cause — Based on your findings, form a hypothesis. Start with the simplest and most likely explanation first. Consider recent changes, environmental factors, and known issues.
Test the theory to determine the cause — Run targeted tests to confirm or eliminate your theory. If the theory is not confirmed, return to step two and form a new hypothesis.
Establish a plan of action and implement the solution — Once the cause is confirmed, plan your fix. Consider the impact on the user and other systems. Follow your organisation's change management process for anything beyond a simple repair.
Verify full system functionality and implement preventive measures — Confirm the fix resolves the original issue without introducing new problems. Apply patches, update configurations, or implement monitoring to prevent recurrence.
Document findings, actions, and outcomes — Record the symptoms, root cause, resolution, and time spent in your ticketing system. This builds a knowledge base that benefits the entire team.

CompTIA Troubleshooting Steps

70%

Issues Resolved at First Line

15 min

Average L1 Response Target

30%

Tickets From Repeat Issues

Why Systematic Beats Random

Industry data consistently shows that structured troubleshooting reduces mean time to resolution (MTTR) by 40% or more compared to ad-hoc approaches. When support engineers follow a systematic process, they avoid the common trap of swapping components or reinstalling software without understanding the root cause. This is especially important in ITIL-aligned UK service desks, where first-call resolution rates and SLA compliance are closely tracked KPIs.

Hardware Troubleshooting

Hardware issues account for a significant proportion of IT support tickets. The key to efficient hardware diagnosis is understanding what the system tells you during startup, recognising physical symptoms, and knowing when a component has genuinely failed versus when it is a configuration or connection issue.

POST Errors and Startup Failures

The Power-On Self-Test (POST) is the first diagnostic your computer runs every time it powers on. The BIOS or UEFI firmware checks critical components including the CPU, RAM, storage, and video output. If POST fails, the system communicates the problem through beep codes, LED patterns, or on-screen error messages. A single short beep typically indicates a successful POST, while repeated or coded beeps signal specific failures.

If a machine produces no display, no beeps, and no fan activity, start with the power supply. Verify the mains socket is live, check the power cable is fully seated, and test with a known-good PSU if available. In the UK, also check that the kettle lead (IEC C13) is not damaged and that any surge protectors or UPS units are functioning.

Common Hardware Symptoms and Causes

Hardware Symptom Diagnosis Reference

Symptom	Likely Cause	Recommended Action
No power at all	Faulty PSU, dead mains socket, blown fuse in plug (UK 13A fuse)	Test mains socket with another device; swap power cable; check fuse in UK 3-pin plug; test with known-good PSU
Powers on but no display	RAM not seated, GPU failure, monitor cable loose, faulty monitor	Reseat RAM modules; try onboard graphics if available; test with a different monitor and cable
Repeated beep codes during POST	RAM failure, GPU not detected, motherboard fault	Note the beep pattern and look up the code for your BIOS manufacturer (AMI, Award, Phoenix); reseat or replace the flagged component
Random shutdowns or restarts	Overheating CPU or GPU, failing PSU, faulty RAM	Check CPU temperatures in BIOS or with monitoring software; clean dust from heatsinks and fans; run memory diagnostics
Clicking noise from storage drive	Mechanical hard drive failure (head crash or motor failure)	Back up data immediately if accessible; replace the drive; do not attempt to open the drive
Blue screen with WHEA_UNCORRECTABLE_ERROR	CPU or RAM hardware fault, overheating, firmware issue	Run Windows Memory Diagnostic; check CPU temperatures; update BIOS/UEFI firmware; test with replacement components
USB devices not recognised	Faulty USB port, driver issue, insufficient power on hub	Test device in different ports; try a powered USB hub; check Device Manager for errors; update USB controller drivers
Laptop battery not charging	Faulty charger, damaged charging port, battery end of life	Test with known-good charger; check charging port for debris or damage; review battery health in system diagnostics

Source: CompTIA A+ Core 1 (220-1101) Troubleshooting Objectives

Overheating: The Silent Culprit

Overheating causes more intermittent faults than many engineers realise. Dust accumulation in heatsinks, blocked vents from placement against walls, and dried-out thermal paste on ageing CPUs all contribute. In UK offices where machines often sit under desks near carpet, dust ingress is particularly common. Regular cleaning schedules and temperature monitoring can prevent a large proportion of hardware-related support tickets.

Software and Operating System Issues

Software problems are the most frequent category of IT support tickets. They range from straightforward application crashes to complex boot failures that require recovery tools. The key principle is the same as with hardware: identify what changed. Most software faults follow a recent update, installation, configuration change, or user action.

Boot Failures and Recovery

When a Windows machine fails to boot, the first step is to determine where in the boot sequence it stops. If the BIOS/UEFI loads but Windows does not, the issue is likely a corrupted boot loader, missing system files, or a driver conflict. Windows 10 and 11 include an Automatic Repair environment that launches after two consecutive failed boots. From this environment, you can access System Restore, Startup Repair, Safe Mode, and the Command Prompt.

Blue Screen of Death (BSOD)

A BSOD indicates a critical kernel-mode error that forces Windows to halt. The stop code displayed on the blue screen is your primary diagnostic clue. Common codes include IRQL_NOT_LESS_OR_EQUAL (usually a driver issue), KERNEL_DATA_INPAGE_ERROR (often a storage problem), and CRITICAL_PROCESS_DIED (a system service has failed). Always note the full stop code and any file name referenced, then check it against Microsoft's documentation.

Common Software Error Reference

Windows Error Codes and Resolutions

Error / Symptom	Common Cause	Resolution Steps
IRQL_NOT_LESS_OR_EQUAL	Faulty or incompatible driver accessing invalid memory	Boot to Safe Mode; roll back recently updated drivers; run Driver Verifier to identify the offending driver
KERNEL_DATA_INPAGE_ERROR	Bad sectors on storage drive or failing RAM	Run chkdsk /f /r from an elevated command prompt; run Windows Memory Diagnostic; check SMART data on the drive
0x80070005 (Access Denied)	Insufficient permissions for Windows Update or installer	Run the process as administrator; reset Windows Update components; check Group Policy restrictions
Application hangs (Not Responding)	Insufficient RAM, application bug, disk I/O bottleneck	Check Task Manager for resource usage; update the application; increase virtual memory if physical RAM is limited
DLL file missing or not found	Corrupted installation, incomplete update, malware removal	Run sfc /scannow to repair system files; reinstall the affected application; run DISM /Online /Cleanup-Image /RestoreHealth
Windows Update stuck or failing	Corrupted update cache, insufficient disk space, conflicting software	Clear the SoftwareDistribution folder; run the Windows Update Troubleshooter; ensure at least 20 GB free disk space
Slow performance after login	Too many startup programmes, fragmented HDD, malware	Review startup items in Task Manager; run a full antivirus scan; check for disk fragmentation on HDDs (not needed for SSDs)
Profile corruption (temporary profile on login)	Corrupted user profile registry entry or NTUSER.DAT file	Check the ProfileList registry key; create a new user profile and migrate data; repair the profile using Microsoft's guidance

Source: Microsoft Support Knowledge Base and CompTIA A+ Core 2 (220-1102)

The SFC and DISM Repair Sequence

When Windows system files are corrupted, the correct repair sequence matters. Always run DISM first, then SFC. DISM repairs the Windows component store (the source files that SFC uses for repair), while SFC repairs the active system files. Running SFC first on a corrupted component store can actually worsen the problem. The commands are: DISM /Online /Cleanup-Image /RestoreHealth followed by sfc /scannow. Both require an elevated command prompt.

Network Connectivity Troubleshooting

Network issues are among the most frustrating for end users and the most challenging to diagnose remotely. The OSI model provides a structured framework for working through network faults layer by layer, starting from the physical connection and working up to the application. In practice, most first-line support engineers work from the bottom up: check the cable, check the IP address, check DNS, check the service.

The Bottom-Up Diagnostic Approach

Layer 1 (Physical): Is the network cable plugged in? Is the link light active on the Ethernet port? For wireless, is the Wi-Fi adapter enabled and connected to the correct SSID? In UK offices with structured cabling, check both the wall socket and the patch panel connection in the comms cabinet.

Layer 2 (Data Link): Is the switch port active? Has the MAC address been learned? Check for VLAN misconfigurations, particularly if the machine has been moved to a different desk or floor.

Layer 3 (Network): Does the machine have a valid IP address? A 169.254.x.x address indicates DHCP failure. Check the DHCP server is reachable and has available leases. Verify the subnet mask and default gateway are correct.

Layers 4-7 (Transport to Application): Can you ping the default gateway? Can you ping an external IP address such as 8.8.8.8? Can you resolve domain names? If pings succeed but web browsing fails, check proxy settings, firewall rules, and application-specific configurations.

UK ISP and Infrastructure Considerations

When troubleshooting internet connectivity in UK business premises, be aware that many SMEs rely on BT Openreach infrastructure, even if their ISP is a different provider. Faults on the Openreach line must be reported through the ISP, not directly to Openreach. Leased lines and FTTP connections have different fault reporting paths. Always check your ISP's service status page before spending time diagnosing internal network issues.

Network Diagnostic Command Reference

These Windows commands are essential tools for any IT support professional diagnosing network issues:

ipconfig /all — Displays full IP configuration including DHCP server, DNS servers, MAC address, and lease information
ipconfig /release and /renew — Releases and renews the DHCP lease, useful when an address is stale or in the wrong subnet
ping [address] — Tests basic connectivity to a host; use ping 127.0.0.1 to test the local TCP/IP stack first
tracert [address] — Traces the route packets take to a destination, showing where delays or failures occur
nslookup [domain] — Queries DNS to resolve a domain name; useful for diagnosing DNS failures or incorrect records
netstat -an — Shows all active connections and listening ports; helps identify port conflicts or unexpected connections
ipconfig /flushdns — Clears the local DNS cache; resolves issues caused by stale or incorrect cached DNS entries
pathping [address] — Combines ping and tracert functionality with packet loss statistics at each hop

Common Network Issues and Fixes

Issue	Diagnostic Steps	Resolution
No IP address (169.254.x.x)	Run ipconfig /all to confirm; check DHCP server status; verify network cable and switch port	Release and renew DHCP lease; restart DHCP service; assign static IP temporarily if urgent
Can ping IP but not domain names	Run nslookup to test DNS resolution; check DNS server addresses in ipconfig	Flush DNS cache; correct DNS server settings; check DNS server availability
Slow or intermittent Wi-Fi	Check signal strength; scan for channel congestion; verify access point load	Move closer to access point; change Wi-Fi channel to avoid interference; update wireless drivers
VPN fails to connect	Check internet connectivity first; verify VPN credentials; check firewall rules for VPN ports	Ensure required ports are open (IKEv2: UDP 500/4500, SSL VPN: TCP 443); reinstall VPN client; check certificate validity
Network drive not accessible	Verify the file server is online; check DNS resolution of server name; test with IP address directly	Re-map the drive using the server IP; check SMB version compatibility; verify share permissions and NTFS permissions
Duplicate IP address conflict	Windows displays a duplicate IP warning; check DHCP reservations and static assignments	Release conflicting leases; correct DHCP scope to avoid overlap with static ranges; assign reservations where needed

Source: CompTIA Network+ (N10-009) and ITIL 4 Incident Management Practice

Printer and Peripheral Troubleshooting

Printer and peripheral issues are a daily reality for IT support teams. Despite advances in plug-and-play technology, printers remain one of the most common sources of support tickets in UK workplaces. Print queue jams, driver incompatibilities, network printing failures, and paper handling problems are all frequent occurrences.

Print Queue and Spooler Issues

When a print job gets stuck, it often blocks the entire queue for that printer. The Print Spooler service in Windows manages the queue, and restarting it is one of the most common fixes. Open Services (services.msc), find Print Spooler, stop it, clear the files in C:\Windows\System32\spool\PRINTERS, then restart the service. This clears all pending jobs and allows new ones to process.

Network Printing in UK Office Environments

Many UK offices use networked multifunction devices (MFDs) from manufacturers such as Ricoh, Konica Minolta, Xerox, and HP. These devices typically use print management software such as PaperCut or Equitrac for secure print release and cost tracking. When network printing fails, check that the printer has a valid IP address, the print server (if used) is accessible, and the correct printer driver is installed on the user's machine. Windows 10 and 11 often install generic drivers through Windows Update that may lack full functionality for enterprise MFDs.

Common Printer and Peripheral Issues

Problem	Likely Cause	Fix
Print jobs stuck in queue	Corrupted print job or spooler crash	Stop Print Spooler service; clear spool folder; restart service; resubmit the print job
Prints blank pages	Empty toner cartridge, clogged inkjet heads, wrong driver	Check toner or ink levels; run head cleaning utility for inkjets; verify correct driver is installed
Paper jams recurring	Worn pickup rollers, incorrect paper size loaded, damp paper	Replace pickup rollers; verify paper guides match paper size; store paper in dry conditions (important in UK humidity)
Printer offline despite being powered on	Network connectivity lost, IP address changed, driver set to offline mode	Ping the printer IP; check DHCP reservation; uncheck "Use Printer Offline" in printer properties
Poor print quality (streaks or fading)	Low toner, dirty drum unit, calibration needed	Replace toner; clean or replace drum unit; run printer calibration from the control panel
External monitor not detected	Faulty cable, incorrect input source, driver issue, display port incompatibility	Try a different cable; check monitor input source; update graphics drivers; use correct adapter (USB-C to HDMI or DisplayPort)
USB device constantly disconnecting	USB power management, faulty cable, driver conflict	Disable USB selective suspend in Power Options; try a different port or cable; update USB hub drivers
Dual monitors showing same image	Display mode set to "Duplicate" instead of "Extend"	Press Windows + P and select "Extend"; configure display arrangement in Display Settings

Source: CompTIA A+ Core 1 (220-1101) Printer and Peripheral Objectives

The Universal Printer Fix Sequence

When a printer issue does not have an obvious cause, follow this proven sequence: 1) Power cycle the printer. 2) Clear the print queue and restart the spooler. 3) Check physical connections (network cable or USB). 4) Verify the printer has a valid IP address and is reachable. 5) Remove and reinstall the printer driver. 6) Print a test page from the printer's own control panel to isolate whether the fault is with the printer or the computer. This sequence resolves approximately 80% of printer issues without escalation.

Escalation and Documentation

Knowing when and how to escalate is just as important as knowing how to fix things. In ITIL-aligned UK service desks, escalation is not a failure — it is a controlled transfer of an incident to a team with the appropriate skills, access, or authority to resolve it. Poor escalation, however, wastes time, frustrates users, and damages the reputation of the IT support function.

Escalation Tiers in UK IT Support

Level 1 (L1) — Service Desk: First point of contact. Handles password resets, basic troubleshooting, known-error resolution from the knowledge base, and ticket logging. Most UK organisations target 70% first-call resolution at L1.

Level 2 (L2) — Desktop Support / Technical Support: Handles issues that require deeper investigation, hands-on access to the device, or specialist knowledge. Includes OS reimaging, hardware replacement, software installation, and complex configuration changes.

Level 3 (L3) — Infrastructure / Specialist Teams: Server administration, network engineering, security operations, and application development teams. L3 handles root cause analysis, system-wide faults, and issues requiring vendor engagement.

Vendor / External Escalation: Hardware warranty claims, software vendor support cases, ISP fault reporting, and Openreach line faults. Always gather full diagnostics before engaging external support to avoid unnecessary delays.

Writing Effective Escalation Notes

The quality of your escalation notes directly affects how quickly the next tier can resolve the issue. A well-written escalation includes: a clear description of the symptom, the steps you have already taken, the results of those steps, any error codes or log entries, and the business impact. Avoid vague statements like "it is not working" — instead, describe exactly what happens and what you expected to happen.

Escalation Checklist

Before escalating any ticket, confirm you have completed and documented the following:

User details — Name, department, contact number, location, and asset tag of the affected device
Symptom description — Exact error messages, when it started, frequency of occurrence, and how many users are affected
Steps already taken — List every diagnostic step and its result, including any temporary fixes applied
Environmental context — Recent changes (updates, new software, office moves), other affected users, and related tickets
Business impact — Is the user unable to work? Is there a workaround? How many people are affected? Is there an SLA deadline approaching?
Logs and evidence — Attach screenshots, event log exports, diagnostic command outputs, and any relevant error codes
Priority recommendation — Based on impact and urgency, recommend a priority level in line with your organisation's ITIL priority matrix

Building a Troubleshooting Knowledge Base

Every resolved ticket is a potential knowledge base article. UK IT service desks that maintain an active Known Error Database (KEDB), as recommended by ITIL, consistently achieve higher first-call resolution rates. When you resolve a recurring issue, document the symptoms, root cause, and resolution steps in a format that any L1 engineer could follow. Over time, this transforms tribal knowledge into a shared organisational asset.

Good knowledge base articles follow a consistent structure: title (matching the symptom the user would describe), environment (which systems or software versions are affected), symptoms (what the user sees), cause (what is happening technically), and resolution (step-by-step fix). Include screenshots where they add clarity, and review articles quarterly to remove outdated content.

When to Escalate: Decision Guide

Scenario	Escalate To	Reason
Hardware failure requiring replacement	L2 Desktop Support	Physical access to the device needed; may require warranty processing or asset management
Server or service outage affecting multiple users	L3 Infrastructure	Server-side investigation and remediation; potential change management required
Suspected security incident	Security Operations / L3	Must follow your organisation's incident response plan; do not attempt to clean malware without authorisation
Network-wide connectivity failure	L3 Network Engineering	Switch, router, or firewall configuration issue beyond L1/L2 access and expertise
Software bug confirmed and reproducible	Vendor Support / L3 Application	Requires code-level fix or patch from the vendor; document exact reproduction steps
Issue unresolved after 30 minutes of L1 investigation	L2 Technical Support	SLA compliance; avoid keeping the user waiting while investigating beyond your current skill level

Source: ITIL 4 Foundation — Incident Management and Escalation Practice

Start Your IT Support Career

Mastering troubleshooting is the foundation of a successful IT support career. Our accredited IT Support course covers hardware, software, networking, security, and service desk skills aligned to CompTIA and ITIL frameworks, preparing you for real-world roles in UK organisations.

Explore Our IT Support Course

We value your privacy