ATD360 | Advance Threat Detection

Graph ATD360

Helps big organizations detect, analyze and fix advance threats.

ATD360 is the ultimate weapon for big organizations to detect advanced threats! With its sleek log system, graph visual mode, and threat hunting mode, it makes spotting security threats a breeze. The best part? It provides profiles for all files, connections, registry key values, users, tasks, software, and more! ATD360 has got your back and is the perfect partner to keep your organization secure.

I’ve been working in Graph for 3 years!

My Role

Year 1: ATD Interaction Designer

Year 2 & 3: ATD360 Product Designer

Overall Impact

Total redesign on ATD and createATD360

Combine 4 products to one unique platform 

Design and manage an in-house design system named “Graphix”

My Impact

Total redesign of ATD: Research, Data Analysis, Interaction Design, User Flows, Tests

Create a totally new platform and a new workflow: ATD360 was the combination of 4 related but separate products

Background

The Problem

The system generated thousands of security alarms and events. It was impossible for the users to examine all of them, so they sought an automatic pattern detection system.

💡Why ATD360?

The new design, called ATD 360, underwent a complete overhaul from the previous ATD. Our team redefined the workflow for our users, which involved analyzing thousands of alarms to identify patterns and potential attack flows. We automated many of the steps to streamline the process. Additionally, our users worked with four different products, each with its own panel and lots of information to share. Despite this separation, we found ways to connect the products and improve communication between them.

MY ROLE

Product Designer

When it comes to ATD360 I

💡

Why ATD360?

ATD 360 was a total redesign on old ATD. We redefined our persona’s work flow. They were working with thousands of alarms to find a pattern and an attack flow while most steps of this journey could be automated.

I Designed an Escape Room From Scratch to Run!

I Designed an Escape Room From Scratch to Run!

⚠️ SPOILER ALERT!!!!!

If you want to play DotSawOut don’t continue reading! This doc will spoil everything!

Intro

If I had to pick a job other than product design, I would choose game design without a doubt! Game design has always been a passion of mine, and I spend a lot of my free time studying and learning more about it. Luckily, as a big fan of escape rooms here in Iran, I got the opportunity to design a whole escape room experience – from the storyline and puzzles to the gameplay, atmosphere, and acts. It was an amazing experience that allowed me to combine my skills as a product designer with my love of game design. In this case study, I’ll share my journey as both a product designer and a budding game designer.

500 Escape rooms!

As a self-proclaimed escape room geek, I’ve had the pleasure of playing nearly every escape room in Iran. And let me tell you, it’s a thriving industry here! Despite what some might think, there are countless escape rooms to explore, and we’re proud to say that some of the most creative designs come from our very own country. In fact, there are over 500 escape rooms spread across different cities in Iran. After experiencing so many different games, my friends and I decided to take on the challenge of designing our very own escape room. That’s when I dove headfirst into game design courses and books to learn as much as possible.

Prologue

At the beginning of our project, we noticed that escape rooms were becoming saturated with horror themes. So, we decided to shake things up and choose a different genre altogether – Sci-Fi! Our story, DotSawOut, is set in the year 2090 and follows a physicist who takes it upon himself to launch a rocket to destroy a meteorite hurtling toward Earth. Why, you ask? Well, after NASA rejected his and his team’s theory on how to save the planet from certain doom. Unfortunately, Dr. Haywood’s project fails, leading to the tragic deaths of millions in the eastern hemisphere. Three months later, NASA receives a signal from the East, and that’s where you come in! As players, you’ll act as agents sent to investigate the source of the signal.

Where did we start?

When it came to creating our escape room, we knew that the key to success lay in the Experience Design. Before even delving into the story or characters, we asked ourselves: what experience do we want the players to have? With a wealth of ideas at our fingertips, we ultimately settled on a Time Travel experience – something that was completely unique at the time. From there, we began fleshing out the idea by adding mechanics, and dynamics, and structuring the gameplay. We designed missions, quests, and puzzles to create a fully immersive and engaging experience for players.

Storyline

“Why do they need to travel in time?” you might ask. Well, in our game, players must travel through time to prevent a disastrous event from occurring. As they begin their journey as NASA agents investigating a strange signal, they encounter a damaged robot that has lost its memory. To help the robot remember what happened, players must use a time machine found in Dr. Haywood’s workspace. This machine allows players to journey back in time and prevent the explosion of Dr. Haywood’s rocket, thus averting a catastrophe that could spell doom for humanity. It’s an exciting and thrilling adventure that challenges players to think on their feet and use their skills to save the day.

 

 

After discovering Dr. Haywood’s notes, players will be tasked with going back in time to destroy the resources he used to create the catastrophic rocket. Each time the time machine is used, players will be transported to a new location, providing an exciting and unpredictable adventure. We truly thought out of the box in designing our game. In one mission, players must even destroy one of Da Vinci’s inventions, ensuring that nothing ever flies! With every twist and turn, players will be challenged to think creatively and use their problem-solving skills to navigate the game’s complex and intriguing storyline.

Gameplay: Main Mechanics

As we wanted to make a time travel experience we needed to create a time machine with all its mechanics that players want to interact with and travel to a time. As we wanted to create two past and one future experiences we had the same approach to experience design for those locations too. 

This Case Study is being updated….

Zylem Process Mining: A Deep Product Adventure

Zylem Process Mining:
A Deep Product Adventure

Intro

What is process mining? That was my question too when I joined Zylem as a UX designer. Essentially, it’s a way to analyze data from various sources like IT systems and databases, to identify patterns and optimize business processes. It’s like putting on a pair of X-ray goggles for your company’s operations – you can see everything that’s happening in your processes, from start to finish.

Zylem is an intelligent process mining software that differentiates itself from other similar products like Celonis, Minit, and Pafnow. As the first team member after the product manager, I dove into the product and created a fast MVP to test.

As a semi-game designer, I love adventure. That’s why I was excited to explore the world of process mining. I’ve worked with neural networks before, so I had a basic understanding of data mining, but process mining was a new challenge for me. In this case study, I’ll take you on a journey through our product development process, which turned out to be quite an adventure. So buckle up and get ready to learn!

The Map

When I joined the Zylem team, I realized I needed to learn the ins and outs of process mining. It wasn’t an old subject, so there weren’t many sources available. But lucky for us, we had not one, not two, not three, but four academic professors as product advisers. They provided me with recorded process mining classes from Germany, and I eagerly began my studies. Think of it as my very own Indiana Jones adventure, except instead of searching for treasure, I was searching for process-mining knowledge.

 

After ten days of intensive studying, I finally felt ready to begin the benchmarking process. There weren’t many products for process mining on the market at the time, with Celonis being the biggest one. So, I decided to conduct a light benchmark on their IA and core features. And you know what? It was during this process that I realized why Zylem’s core values could win the race! It’s like we had our very own secret weapon, except instead of a weapon, it’s a set of values that help us stand out in the crowded process mining market.

The Map Legend

You may be wondering what our core value was. Well, it was the Zylem Connector! During the benchmarking phase, I focused on understanding the main features of a process mining tool and how they were being developed. However, I still didn’t have a clear understanding of our users, the problem we were solving, the time to market, or our plans for expanding the product. That’s why I conducted stakeholder interviews to get some answers. Here are some of the questions I wanted to address:

  • Who exactly is our target user? (Of course, we needed to conduct user research to confirm our assumptions, but I wanted to hear the stakeholders’ ideas first.)
  • What problem are we solving, and why should a company use Zylem instead of Celonis?
  • How are we solving this problem?
  • What’s our vision for the future?
  • What’s the size of the market we’re targeting?

 

After a brief discussion with the stakeholders about our target users, I asked our product advisers to connect me with current users of Minit and Celonis. This user research was started early in the process, even before we began the main development. Why? Because I was new to the domain and needed to understand the users of these types of products better, despite the courses I had taken.

📑 First User Research: Who are you!?

My goal was to conduct exploratory research in a short period of time, just to get a better understanding of users in the domain and to validate the stakeholders’ ideas. This data wasn’t going to be used directly to make decisions about the features or product structure.

 

I conducted interviews with three users of Minit and three users of Celonis. I was a bit disappointed with the number of participants, as I wasn’t sure if the data would be insightful or reliable if each participant had a different pain point. It was a tough situation for me, full of uncertainty.

 

To start the research, I created a screener document and got approval from the stakeholders. It was challenging to convince them to spend even three days on user research, as they were eager to start the product as soon as possible due to our marketing team’s strict time-to-market requirements. But after three days, I was the happiest designer ever! Why, you ask? Because five out of six participants mentioned a main pain point that I had already heard about in the stakeholder interviews.

 

I wrapped up my information and create an Affinity diagram to keep track of other pain points and start the creation of a persona skeleton.

The Monster in the Mountains

In the realm of process mining, there once lurked a daunting beast known as the “Connector”. Allow me to elaborate. Process Mining software requires an input file known as an “Event Log”, which follows a standard template across all products. This file contains crucial information such as case ID, activity, time stamp, and optional columns like duration and income. However, many companies lacked this type of file or table in their databases, with their event logs scattered across different tables. For example, support center logs would be in a separate table from sales department logs, and income/outcome data would be stored in a completely different table. As a result, when they purchased process mining products, engineers had to spend two to three weeks on-site organizing the data and addressing data security and privacy concerns before the event log system was finally ready for use.

 

Not exactly a seamless process, to say the least. Fortunately, Celonis has developed some nifty solutions like templates and connectors for various databases to streamline the process. Nonetheless, the engineers still need to make their way to the company to get their hands on the data and whip it into shape. All in a day’s work for the Connector-slayers of the process mining world!

Secret Weapon: Zylem Connector

Our tech team has been hard at work building a secret weapon that can slay the monster in an instant! We’re talking about an AI that can learn from your database and automatically organize and create an event log file, regardless of the database type or how your data is populated.

 

 

As part of the solution, my role was to create a user flow to provide the information that our neural network needed. Since there was no engineer on-site and the code wasn’t running on our server, there were no privacy issues to worry about. The AI, of course, needed some input information to get started on its work.

Experience Tree: User Flow

I had a clear understanding of the problem statement from the get-go. We needed to gather information from the user to create their customized connector. However, there were some important requirements that I needed to keep in mind while designing the user flow:

 

  • Since there was a lot of information to gather, the flow had to be as smooth as possible.
  • Many users have found other process mining products to be overly complicated and not user-friendly, so we needed to make sure our solution was easy to use.
  • There are different types of users with different permissions when it comes to using process mining products.
  • Companies use different types of databases, so we had to account for this in our design.
  • Finally, we wanted to make sure that users felt confident that Zylem was the right product for them.

The Solution: 3-Step Data Connector

I’ve been having some really informative meetings with our tech team lately to better understand how our model works and determine what information we need to input. The model is quite complex, with a lot of dependencies and a lot of going back and forth. However, I was able to make a valuable contribution by creating a step-by-step process that makes it easy for users to provide all the necessary information without feeling overwhelmed. This IA has been a helpful guide for us and has streamlined our workflow.

The project was vast and intricate. To begin, I had to organize the information from scratch and break it down into coherent groups of steps for the user to follow. Fortunately, there were certain aspects that could be concealed from view until they were needed, which helped to simplify the process. All in all, it was a challenging endeavor that required a lot of effort, but I’m proud of the end result.

Then I needed to prioritize these groups based on the previous research data and our tech team’s concerns. I ended a user flow like this : 

The first flow detailed everything I wanted to happen, while the second flow was designed to present users with grouped information. So, what exactly were these groups? Let me explain how I approached the user interface (UI) design.

 

 

The ultimate solution I came up with was a three-step data connector that could connect to databases of any complexity. Although there were many states and pages behind the scenes, I made sure that the user could see the easiest and most intuitive steps at first glance.

Flow Start: Resources

Our AI can create an event log by merging files or databases of different types that users can add in any combination. To simplify the process, I designed it to start with just the first resource and then users could add more later. Once they complete the first step, they can learn more about how the flow works and add additional resources as needed.

 

While considering different approaches, one idea was to allow users to add all resources simultaneously. However, during the Sketch phase, we decided to reject this idea because each resource type has its own flow. Managing all of the data from various resources in a single step would have made the process unnecessarily complex.

Here I’ll go with one sample file flow.

First Step: File Upload

Users have the ability to add multiple files, with each file being considered a separate resource. To simplify the process, all files can be added in a single step because they follow the same flow. Users can upload files from a URL or their local host, and a preview of their table is available to review before uploading.

 

This approach streamlines the process for users, allowing them to add multiple files at once while still providing a preview of each file’s contents before they are uploaded.

To streamline the process, I grouped all the file-related features together in the IA, so that users can perform all actions related to uploaded files in a single step. This includes managing files, editing them, removing them, and adding new ones.

 

By consolidating these features into a single step, users can easily navigate and manage their uploaded files without having to switch between different sections of the UI

Second Step: Event log Creation

This is where the magic happens! Our secret weapon needs bullets obviously. User needs to specify what type of event log they want and they can customize columns to achieve their desired event log. 

The second step is the most complicated step that needed a layer-based data visualization. what do I mean? well, we wanted to have these features on the Event Log Group that were made in IA in the previous phase:

  • Select a template.
  • Edit columns based on what they want to monitor.
  • Edit the result based on different resources given to the algorithm.
  • Remove, Add, Edit, or change the type of a column.

 

The challenge was managing all these features and showing them exactly when it’s needed. That’s where the detailed version of user flow(Mentioned before) was helpful and I used it to achieve sth like this:

After all the tunings the adventure starts…

Third Step: Load Data

The final step was checking the results and loading the data. Simple as possible.

This Case Study is being updated….

ATD360: The Spaghetti Product!

Graph ATD360

This One Helps You In : User-Flow, Data Analysis, Information Architecture, and Research Method Selection✨

ATD is a highly sophisticated web application designed for security operation centers. Its primary function is to detect advanced network threats and generate an alarm for each set of events that match our pre-defined rules. The end-users of the application are security specialists who analyze alarms on a daily basis and make decisions based on the level of threat detected. When I joined the team, I was faced with the daunting task of tackling the complexity of this "Spaghetti Product".

A header image from graph attack flow

Intro

“Graph” is a medium-sized enterprise with four distinct products: FAM (File Analysis Module), EDR (Endpoint Detection and Response), ATD (Advanced Threat Detection), and SandBox. While each product is sold separately, they are all interrelated and require data sharing to function effectively.

 

For instance, suppose a security specialist receives a threatening alert on ATD, indicating that a malicious file has been uploaded to their mail server. The next step would be to analyze the file using FAM to determine if it is indeed infected. If the file is not detected, the behavior can be manually explored using Sandbox. From there, the specialist may need to search for other instances of the file beyond the mail server and remove it from all agents containing the file with the help of EDR.

 

Without a system of interconnected products, managing these communications manually would be an arduous and time-consuming task.

To comply with my non-disclosure agreement, I have omitted and obfuscated confidential information in this case study.

"Profiles" Feature

THE CHALLENGE

My initial observation from the Marketing team was that the product was not user-friendly without the assistance of the Graph support team. This was evident as even our colleagues struggled to work with the application, especially when communicating across the four products.

 

To address this issue and regain lost customers, our objective with the new platform is to incorporate all features of the four products within two years without the need for support assistance. Our high-level goals for this initiative include:

  1. Developing an intuitive platform that is easy to use and does not require support from our team.
  2. Integrating the features of all four products into a single solution.
  3. Enhancing the flow of data between components, thereby eliminating any disruptions.
The previous product was developed solely by developers without the involvement of a dedicated designer.

My Role

When I first joined the team, there was no designated designer. The ATD product had been created solely by developers without any input from other roles, leaving me to take on the responsibility of the entire design process.

 

During the early stages of development, I served as a researcher, delving deeper into the problem than the marketing team had done previously. As the project progressed, I took on the additional roles of interaction designer, UI designer, and test manager, all while continuing to conduct research. Despite the numerous responsibilities, I am proud to say that I was able to achieve a significant amount.

The KickOff

At the onset of the project, I encountered a significant challenge in defining a clear mission and specific goals for the new platform, given its vast scope. To better understand the user requirements and how our support team could assist them, I conducted contextual inquiry research with eight participants who were experts in SOC operations. This research allowed me to gain insights into their current product usage behaviors and mental models.

 

The research uncovered several key findings, including:

  1. The main section for data was deemed useless by most users, who instead relied on the forensic feature, resulting in the need for multiple open Chrome tabs and difficulty navigating between them.
  2. Nearly 70% of users were not aware of the full capabilities of our product due to the lack of an automation feature. As a result, they had to manually search for key values and related hosts for every alarm generated, while our backend could have automated this process.
  3. Users struggled to understand the flow of threats and attacks despite this being the product’s core value as intended by the development team.
  4. The asset management feature was not satisfactory, and the EDR module lacked essential data that was not being sent to the UI.

The Process

After the initial research phase, stakeholders expressed a preference for a complete redesign of the product, but due to production constraints, this was not feasible. As a result, I developed a customized design process for our new team with a focus on several key objectives:

  1. Ensuring the maintenance of our current selling products.
  2. Defining our primary persona.
  3. Restructuring the scenarios based on the daily work process of our persona.
  4. Prototyping new solutions for the new IA.
  5. Testing the new solutions.

As part of this process, my primary responsibility was to focus on the development of a new all-in-one product, ATD 360. We recognized the need for a dedicated designer to work alongside our team to address current usability issues during the maintenance phase

Discovery

🎯 First Attitudinal Research: What Are We Solving?

To better understand the needs of our users, I initiated a research process consisting of one-on-one interviews with 10 participants over the course of one week. As a first step, I created a screening document in collaboration with the product manager to identify and select suitable participants, and planned the logistics for the interviews including scheduling and location arrangements.

 

 

During the interviews, which were conducted in a hybrid format, I prepared a list of targeted questions and topics to discuss with participants, tailored to their daily work processes. Following the interviews, I analyzed the data and generated an affinity diagram to better visualize and understand our users’ needs, ultimately leading to the creation of our first user persona.

What They Said

  1. Checking alarms manually was a pain point for them as it was time-consuming and prone to errors.
  2. They needed a system that could group similar alarms together and provide them with relevant information at once.
  3. They wanted a way to prioritize alarms based on severity and potential impact.
  4. The lack of automation was causing them to miss some critical threats and they wanted a more proactive system that could alert them in real-time.

Based on these findings, we started to develop a new solution that focused on automating the alarm management process, grouping similar alarms together, and providing contextual information to SOC analysts in a more efficient way. We also prioritized the severity of alarms and developed a system that could alert analysts in real-time when critical threats were detected.

🌀 Second Attitudinal Research: Why Are They Selecting This WorkFlow?

During my research, I sought to gather domain-specific knowledge related to network security. In order to accomplish this task, I concurrently reviewed the university’s security book to better understand the challenges being faced in the absence of ATD. In the second week of my research phase, a significant breakthrough was achieved. Through careful analysis, it became apparent that the key to understanding the issue lay in identifying the relationship between alarms. Specifically, the security team was manually cross-referencing alarm values to determine whether or not there was a connection between them. For instance, if an ATD-generated alarm indicated a malicious file, the team would investigate whether or not that file was present on other hosts or identify its origin. By discerning the relationship between alarms, numerous questions could be answered. To our surprise, the SOC team was utilizing this technique with our product, although we had not planned nor implemented this feature.

Insights

Upon obtaining new insights, I initiated the creation of a redesigned information architecture (IA) for our forthcoming all-in-one platform, ATD360. To achieve this goal, it was imperative to establish frequent and robust collaborations with our engine, back-end, and data analysis teams. Our main challenge was to devise a new methodology for extracting relational insights from the data. Our existing data structure was complex and disorganized, making it difficult to identify patterns and extract meaningful information. To overcome these obstacles, we followed the following steps in close collaboration with our engine, back-end, and security data analysis teams:

    1. Reframed the problem with the new user flow in mind by identifying the types of relationships between alarms in real attack scenarios, determining which key values were useful in different alarms, and discerning which data sections were being used manually in actual attack scenarios.
    2. Created a new IA based on our current data structure on the back-end and engine side. This proved to be the most challenging phase of the design process.
    3. Shaped the user flows based on the updated Persona behavior to ensure a seamless and effective user experience.

The new ia

Initially, I had numerous ideas racing through my mind, and I struggled to discern which would be most effective in crafting a compelling relationship presentation within alarms. I asked myself, is it feasible to automate the process, or is it best suited for a human touch?

To begin, I thoroughly examined 120 genuine alarm data sets and drafted a comprehensive alarm data structure that incorporated both crucial and less significant fields and key values. By selecting alarms that had been generated at various times and on different dates, I ensured that no bias would be introduced.

Next, I identified the most prevalent attack scenarios and closely scrutinized the essential fields and key values of each alarm that was associated with a particular scenario. This method was similar to the daily workflow of SOCs utilizing our alarms.

Ultimately, this approach yielded a set of crucial key values that were pervasive across all alarms and repeatedly appeared in the majority of attack scenarios.

🛠 Third Attitudinal Research: How Can I Find The Pattern?

Firstly, I decided to conduct card-sorting research to identify how our key values can be grouped. The results of the card-sorting exercise proved to be invaluable after the initial data analysis. I discovered some key values that were common among the most important alarms, but the challenge remained – how do we present these insights to our Persona in a meaningful way?

Our solution was to group the key values and create a new flow to address the problem statement identified in the early phases of our research. To validate our solution, I ran a card-sorting session with eight participants selected by our Persona. The session was a success and I was able to identify the first semantic grouping of key values between alarms. This was a glimmer of hope in the pitch of darkness and I felt confident that I was finding the elementary pattern to show the relation between alarms.

After the card-sorting phase, I used my findings to create the first version of our new product, ATD360’s IA. Our new form of data presentation was named “Profiles” and this was just the beginning.

The new FLOW : the insights inside alarms

To optimize the use of our “Profiles” solution for data presentation and alarm relation extraction, I decided to create a new flow. Our goal was to provide our Persona with the insights they wanted without the need for manual searching between alarms. To achieve this, I went back to the interviews and listened to their voices once again. I reviewed how they worked and what insights they really wanted.

However, I didn’t feel confident with just the voices I had. As a product designer, you always want to be sure that you are on the right track. Unfortunately, there was no time to conduct new research. So, I tried to find patterns with the existing data and discovered some key insights.

Firstly, there was always a point that connected alarms to each other, such as the host, IP, file, or connection. Secondly, hosts were always important, and one of the most common tasks they performed was checking the source or destination hosts. Thirdly, following clues was the primary reason for back-and-forth searching. Lastly, applying commands and actions was the most challenging job for every type of element, and they acted differently.

After analyzing the data, I began creating an algorithm that would make these connections. The algorithm was the answer I was looking for, and I believed that it was all I needed to create my new user flow.

The Algorithm: Is it how designers solve a problem?

Well, No!

To solve the problem of manual searching between alarms, I created an algorithm that would show the connection between them. At first, I was hesitant to present the algorithm to the super professional SOCs team, but I persevered despite facing issues with the back-end team.

My solution was based on a graph, where nodes represented hosts and edges represented the events that occurred to them. I chose a graph because I needed a connection-based diagram that was interactive and could present a scenario.

The first version of the graph showcased a simple attack scenario based on the alarms we already had. After conducting a heuristic study with the previous 8 participants, it was evident that everyone loved it!

In the subsequent versions, we added other types of actions and connections, making the graph more complex. Eventually, we arrived at a graph that included profiles showing the grouping of key values based on alarm type, and the relation between alarms was fully visualized without the need for manual searching.

The usability test was the next step in our journey.

Wait! Profiles?

 

Yes! My first solution to get a better structure on key values in an alarm was based on card sort research. I brought profiles to nodes and edges, which led to a fluent experience in drilling down to the root cause of an alarm.

Before the test phase, I was excitedly thinking to myself, “This is exactly what they want!” For the first time, I felt confident with the solution I had come up with.

How Does it Work?

 

While we didn’t remove the manual data extraction from alarms, we introduced a new user flow called “Incidents.” This panel displayed a list of graphs created by our algorithm based on a set of alarms. By selecting a graph, all hosts involved in that set were displayed as nodes, and the actual events as edges. With just one look, it became clear how those sets of alarms were connected to each other.

From there, users could easily drill down and search for important key values. And the best part? Profiles were already there, customized for the specific alarm.

Does it Work?

 

We initiated the project with a cohort of 12 new participants, and subsequently conducted a usability test phase. The results demonstrated an impressive 92% task success rate and 83% satisfaction level, which although not ideal, served as a promising start for our novel platform. Subsequently, we created a specialized section of the platform exclusive to our loyal users, and began monitoring their activity via Hotjar. Upon analysis of the obtained data, we made the following observations:

  1. While the primary solution was functional, many users still relied on manual drill-down features and alarms lists in numerous scenarios.
  2. Our profiles were limited to a single-level categorization, whereas users required greater depth and specificity, such as searching in other hosts or mail servers.
  3. File transfers to antiviruses and subsequent results retrieval continued to pose a challenge.
  4. During attack scenario testing, Sandbox was employed as a separate platform, rather than being integrated into the primary one.

Next phase

Our primary algorithm and solution were embraced by the users who expressed a desire for additional features. This presented a crucial juncture in our journey, as we required a new design team to develop the features that our data indicated were necessary for scaling. We proceeded to onboard two new designers, though initially, their abilities could not be fully utilized. It was necessary to acclimate them to our domain and ensure a seamless integration into the team.

While the designers focused on improving usability KPIs and maintaining the current solution, I began formulating research plans for future endeavors. Among my priorities was establishing a streamlined flow between our FAM and Sandbox products. Additionally, I sought to implement a new search feature in our graph view to enhance functionality.

FAM & Sandbox: What are they solving?

 

In order to understand why users were utilizing our product and how they were doing so, we conducted a contextual inquiry. The findings were straightforward on an abstract level: Antiviruses! Our FAM product provided users with access to 13 different antiviruses that could scan a file and generate a comprehensive report on the infection rate and level of confidence regarding the likelihood of the file being infected.

However, the effectiveness of FAM was not infallible, as new strains of malware and trojans were emerging that could evade antivirus scans. To address this concern, we developed the Sandbox product. Sandbox provided users with a safe, customizable virtual environment in which they could run a file and obtain a detailed report of the behavior exhibited during file execution, along with any enabled indicators.

Despite the efficacy of both products, many users were compelled to leave the graph view in order to upload files to either FAM or Sandbox, causing a disruption in the user experience.

Embedding Sandbox & FAM to The New Graph

 

To address this issue, I decided to integrate the FAM and Sandbox results alongside every file, process, and email in the attack scenario, rather than requiring users to navigate to separate sections. By repeating the Profile solution, which had previously been utilized for hosts and key values, for all individual elements in the attack scenario, I established a pattern that users could recognize and seek out for structured information.

To display the FAM results, I created a new section within the Profiles that showed all antivirus results without requiring any action from the user. This made the FAM results visible and easily accessible when viewing the file profile.

However, incorporating Sandbox results was more challenging. FAM queries were lightweight and could be processed in seconds, but Sandbox was a time-consuming process that could significantly impact the optimization of the graph view feature. As a result, I designed the Sandbox section in a way that required users to actively request the results, with the understanding that the process would not be instantaneous.

They Were Still Getting Out of the Graph View Page😭

 

Despite the addition of FAM and Sandbox to the profiles, there were still scenarios that required a manual search, as the graph view was limited in its functionality. This was due to the fact that while the new graph view displayed connections, it did not provide actionable steps for users to take. For example, if a malicious file was discovered in the graph, SOC analysts may want to know which other hosts also had the file and which of those hosts had executed it. Additionally, if they wanted to remove the file from all hosts on the network, they were left without guidance on how to do so.

It was clear that one crucial piece of the puzzle was still missing…

Profiles v2

Initially, I had anticipated that the Profiles feature would be representative rather than interactive. However, it became apparent that we needed to incorporate the main actions that Security Operations Centers (SOCs) were already taking through query writing. To this end, I collaborated with our back-end team to source the relevant data. I posed a question to them about the most frequently used queries emanating from the alarm list.

Following their response, I presented the queries to our Cyber Security Manager for sorting, after which I conducted a brief Closed Card-Sorting research to categorize them according to the existing profiles. The outcome was a comprehensive set of actions aligned to each profile type, which proved to be a significant achievement considering the simplicity and swiftness of the research process.

Profiles v2, therefore, offered a novel approach to displaying essential values while facilitating action based on the data type. For instance, with the new feature, users could readily remove a malicious file by simply clicking a button, without the need to write a query specifying the file and the host.

However, a challenge arose when the team sought to remove the file from all the containing hosts.

 

Results: The New Panel

Our team learned from our initial vision that information presentation alone was not enough, and we needed to prioritize user interactions as well. One area that needed improvement was searching within our system, as SOCs were spending significant time manually crafting long search queries.

To address this, I reviewed our data and identified the most common search queries. I then incorporated these queries into the profiles section and created a new panel to display the results, aptly named “Results.” To keep all results in one place and prevent users from switching between multiple pages, I used a tab pattern.

In a subsequent usability test phase, we saw a significant decrease of 95% in page switching, indicating that our new design was more efficient for SOCs to use. However, we continued to monitor other KPIs to ensure a holistic improvement of our platform.

The Impact

Since the inception of the project, I have been meticulously curating a list of key performance indicators (KPIs) that required enhancement. I ensured that this list was continually updated in every stage of feature development. At present, I am pleased to present the final version of this list, along with the results from the latest usability test.

Usability KPIsZero State (Before Design Team)First Phase(Graph)Second Phase (+The Profiles)Third Phase(+Results)
AVG Task success rate90%(It is for manual alarm analysis)82%90%90%
AVG Error rate5%8%3%3%
Page Switch100%80%65%30%
User satisfaction25%60%60%75%
Attack Scenario Detection80%75%80%85%
Manual Alarm Analysis100%80%65%40%
Support Call per Attack(MDR)75%80%75%60%
FAM API Call0%0%100%100%
FAM Manual Access100%100%20%20%

 

Never Enough! Never! Never!

As we continued to develop new features for Graph ATD360, we encountered new challenges that needed to be addressed. For example, there were certain queries that required more time to generate results, and there were certain actions that our users wanted to take automatically. To tackle these issues, we developed three new features: Queueing, Auto Response, and Business Rules. These features were designed to improve the user experience by streamlining certain tasks and automating certain actions, while still ensuring that our platform remained efficient and effective.

Issues and Barriers I faced: Was it that simple?

During the minimal version of the new ATD360 solution, I had to go through several challenges.

  1. Firstly, I had to convince the development team that an automatic version of data extraction was feasible. This was not an easy task, as it required a significant shift in their mindset and approach.
  2. Additionally, gaining access to SOCs proved to be another challenge. Security analysts are a demanding persona to work with, as their tasks are varied and their goals frequently change throughout the day. As a designer, I had to be flexible and adapt to their ever-changing needs.
  3. Uncertainty was also a significant challenge throughout the project. At no point could I be certain that I was developing the right solution, and this made the process both enjoyable and frustrating.
  4. Moreover, I had to learn a lot about the security domain. Reading and understanding security-related material was a difficult yet rewarding experience.
  5. Lastly, I had to manage another designer who was maintaining the current ATD product while simultaneously adding features to the new solution. This required effective communication and collaboration to ensure that both projects were progressing smoothly.