Skip to main navigation Skip to search Skip to main content

Combining Word Embeddings with Fuzzy Logic to Protect Web Applications Fuzzy VADAS

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

In the early era of the Internet, webpages only contained static content, such as text and images. However, with the emergence of Web 2.0, a new set of dynamic web applications appeared, such as online banking, e-commerce, social networking, gaming, and others that revolutionized the industry. These new technologies have presented a new set of vulnerabilities that can be exploited by malicious users for multiple purposes, such as data exfiltration/modification/deletion, privilege escalation, malware installation, DDoS (Distributed Denial of Service) attacks, etc. In order to detect some of these web attacks, companies are relying in Web Application Firewalls. These Web Application Firewalls (WAFs) rely on complicated regular expressions (REGEX) that are created by experienced security researchers in order to detect malicious signatures found in tampered HTTP requests. The goal of this research is to provide an alternate way to detect these web attacks without relying on complicated regular expressions. VADAS (Valence Aware worD embedding for web Application Security) approach to detect web attacks is by using a set of revised vocabularies (word embeddings created using unsupervised algorithms) that are commonly found in web attack vectors. These embeddings will allow us to calculate a valence score for each word quantifying its positive/negative score by using cosine similarity. The output from VADAS system is connected to a fuzzy logic controller in order to achieve a final 'maliciousness' classification result (Fuzzy VADAS). Preliminary results show that the performance of Fuzzy VADAS is quite effective, obtaining an accuracy of over 98%. The proposed Fuzzy VADAS approach provides a new way of detecting web application attacks, relying on minimal interaction with security experts (refinement of dictionaries, removing good words, etc.). This is a great advantage in comparison to existing REGEX rule-based systems.

Original languageEnglish
Title of host publicationProceeding of the 2023 IEEE 41st Central America and Panama Convention, CONCAPAN XLI 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350380927
DOIs
StatePublished - 2023
Event41st IEEE Central America and Panama Convention, CONCAPAN 2023 - Tegucigalpa, Honduras
Duration: 8 Nov 202310 Nov 2023

Publication series

NameProceeding of the 2023 IEEE 41st Central America and Panama Convention, CONCAPAN XLI 2023

Conference

Conference41st IEEE Central America and Panama Convention, CONCAPAN 2023
Country/TerritoryHonduras
CityTegucigalpa
Period8/11/2310/11/23

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • Artificial Intelligence
  • Fasttext
  • Fuzzy Logic
  • NLP
  • WAF
  • Web Application Firewalls
  • Web Application Security
  • Word Embeddings
  • n-gram

Fingerprint

Dive into the research topics of 'Combining Word Embeddings with Fuzzy Logic to Protect Web Applications Fuzzy VADAS'. Together they form a unique fingerprint.

Cite this