Skip to main content

International Journal of Data Science and Analytics OnlineFirst articles

Open Access 20.05.2024 | Regular Paper

Online boxplot derived outlier detection

Outlier detection is a widely used technique for identifying anomalous or exceptional events across various contexts. It has proven to be valuable in applications like fault detection, fraud detection, and real-time monitoring systems. Detecting …

verfasst von:
Arefeh Mazarei, Ricardo Sousa, João Mendes-Moreira, Slavo Molchanov, Hugo Miguel Ferreira

Open Access 17.05.2024 | Regular Paper

Uncertainty-aware non-invasive patient–ventilator asynchrony detection using latent Gaussian mixture generative classifier with noisy label correction

Patient–ventilator asynchrony (PVA) refers to instances where a mechanical ventilator’s cycles are desynchronised from the patient’s breathing efforts, and may result in patient discomfort and potential ineffective ventilation. Typically, they are …

verfasst von:
Chenyang Wang, Ling Luo, Uwe Aickelin, David J. Berlowitz, Mark E. Howard

16.05.2024 | Regular Paper

Time series forecasting of wheat crop productivity in Egypt using deep learning techniques

Egypt’s agricultural sector plays a critical role in the country’s economy, with wheat cultivation being vital for ensuring food security. However, the challenges faced by wheat farming in Egypt, such as climate change, water scarcity, and pest …

verfasst von:
Amal Mahmoud, Ammar Mohammed, M. M. abdel wahab, A. A. Khalil

16.05.2024 | Regular Paper

Feature extraction for exoplanet detection

Detecting possible habitable planets outside of our solar system has been a growing field of study. Among several other topics, this field aims to classify stars using the transit method, i.e., using their light intensity measured over time to …

verfasst von:
João Pimentel, Joana Amorim, Frank Rudzicz

Open Access 15.05.2024 | Regular Paper

Clustering source code from automated assessment of programming assignments

Clustering of source code is a technique that can help improve feedback in automated program assessment. Grouping code submissions that contain similar mistakes can, for instance, facilitate the identification of students’ difficulties to provide …

verfasst von:
José Carlos Paiva, José Paulo Leal, Álvaro Figueira

Open Access 14.05.2024 | Correction

Correction to: Alternative feature selection with user control

verfasst von:
Jakob Bach, Klemens Böhm

07.05.2024 | Regular Paper

Multi-language: ensemble learning-based speech emotion recognition

Inaccurate emotional reactions from robots have been a problem for authors in previous years. Since technology has advanced, robots like service robots can communicate with people of many other languages. The traditional Speech Emotion Recognition …

verfasst von:
Anumula Sruthi, Anumula Kalyan Kumar, Kishore Dasari, Yenugu Sivaramaiah, Garikapati Divya, Gunupudi Sai Chaitanya Kumar

Open Access 03.05.2024 | Regular Paper

Neural lasso: a unifying approach of lasso and neural networks

In recent years, there has been a growing interest in establishing bridges between statistics and neural networks. This article focuses on the adaptation of the widely used lasso algorithm within the context of neural networks. To accomplish this …

verfasst von:
Ernesto Curbelo, David Delgado-Gómez, Danae Carreras

30.04.2024 | Regular Paper

Swarm-based support vector machine optimization for protein sequence-encoded prediction

Protein is considered the important macronutrient for most of the biochemical activities of all living organisms. Many healthcare applications involve protein to protein interactions (PPIs) to predict diseases, DNA characters, and more. PPI …

verfasst von:
Prasanalakshmi Balaji, K. Srinivasan, R. Mahaveerakannan, Sudhanshu Maurya, T. Rajesh Kumar

29.04.2024 | Regular Paper

A unified approach for continuous sign language recognition and translation

Sign language recognition (SLR) is an emerging technology that shows potential in facilitating communication between the deaf and the hearing people. The sign language system employs a recognition module to generate glosses from videos of sign …

verfasst von:
Vaidehi Sharma, Abhay Kumar Gupta, Abhishek Sharma, Sandeep Saini

27.04.2024 | Regular Paper

Performance analysis of collaborative real-time video quality of service prediction with machine learning algorithms

With the exponential rise in the development, deployment and use of Internet applications in recent years, communication paradigms are continuously flooded with unmatched dimensions of real-time video traffic which amounted to 65.93% of the total …

verfasst von:
Lavesh Babooram, Tulsi Pawan Fowdur

25.04.2024 | Regular Paper

Online segmented thickness prediction of hot rolling strip based on IBA-XGBoost

An online segmented thickness prediction algorithm for steel strips based on machine learning is proposed to address issues of strong coupling and low accuracy in existing mathematical thickness models. Firstly, the rolling data are divided into …

verfasst von:
Fei Zhang, Shuo Huang, Li-jun Wang, Yong-jun Zhang, Yan-jiao Li, Xue-zhong Huang

Open Access 25.04.2024 | Review

An overview of sentence ordering task

The sentence ordering task aims to organize complex, unordered sentences into readable text. This improves accuracy, validity, and reliability in various natural language processing domains, including automatic text generation, text summarization …

verfasst von:
Yunmei Shi, Haiying Zhang, Ning Li, Teng Yang

22.04.2024 | Regular Paper

Enhancing author assessment: an advanced modified recursive elimination technique (MRET) for ranking key parameters and conducting statistical analysis of top-ranked parameter

Assessing the impact of authors in scientific research is crucial for evaluating scholarly contributions. Various parameters exist in the literature to quantify researchers’ productivity, such as publication count, citation count, and the h index.

verfasst von:
Ghulam Mustafa, Abid Rauf, Muhammad Tanvir Afzal

20.04.2024 | Regular Paper

Building the interpolating model for interval time series based on the fuzzy clustering technique

In the development of the social-economics of countries, time series is a data type stored commonly nowadays. For these data, forecasting has always received the attention of statisticians and managers because it brings to great advantages.

verfasst von:
Dan Nguyen-Thihong, Loc Tran-Phuoc, Tai Vo-Van

20.04.2024 | Review

Real-time anomaly detection in sky quality meter data using probabilistic exponential weighted moving average

Light pollution is a problem that impacts many elements of human life and the environment, including astronomical observations. The authors of this work offer a unique method for detecting anomalies in night sky brightness data recorded using a …

verfasst von:
Lala Septem Riza, Zulfikar Ali Yunara Putra, Muhammad Iqbal Zain, Fajar Zuliansyah Trihutama, Judhistira Aria Utama, Khyrina Airin Fariza Abu Samah, Dhani Herdiwijaya, Rinto Anugraha NQZ, Emanuel Sungging Mumpuni, Rhorom Priyatikanto

Open Access 16.04.2024 | Regular Paper

Enhancing the Vietoris–Rips simplicial complex for topological data analysis: applications in cancer gene expression datasets

The aim of this study is to enhance the extraction of informative features from complex data through the application of topological data analysis (TDA) using novel topological overlapping measures. Topological data analysis has emerged as a …

verfasst von:
Lebohang Mashatola, Zubayr Kader, Naaziyah Abdulla, Mandeep Kaur

15.04.2024 | Review

Predicting the pharmaceutical needs of hospitals using machine learning algorithms

People’s lives are always threatened by various diseases. The role of health and medical services, in particular medicine, is undeniable in protecting their lives. Timely preparation and providing medicine for patients is vital since medicine …

verfasst von:
Amir Hossein Nabizadeh, Mohammad Mehdi Ghaemi, Daniel Goncalves

14.04.2024 | Regular Paper

K-means DTW Barycenter Averaging: a clustering analysis of COVID-19 cases and deaths on the Brazilian federal units

A challenge faced while monitoring the COVID-19 pandemic in Brazil is the identification of patterns of incidence and mortality, which can help prioritize interventions to avoid excessive disease transmission and associated deaths. This study …

verfasst von:
Jonatas Silva do Espirito Santo, Jackson Santos da Conceição, Lilia Carolina Carneiro da Costa, Rosemeire Leovigildo Fiaccone, Marcos Ennes Barreto, Maria Yury Ichihara, Anderson Ara

13.04.2024 | Regular Paper

A common-specific feature cross-fusion attention mechanism for KGVQA

Knowledge graph-based visual question answering aims to utilize the information in the knowledge graph to assist in answering complex questions that are difficult to answer based on image features alone. However, using knowledge graphs increases …

verfasst von:
Mingyang Ma, Turdi Tohti, Askar Hamdulla