Skip to main content

Knowledge and Information Systems OnlineFirst articles

Open Access 24.05.2024 | Regular Paper

Tuning structure learning algorithms with out-of-sample and resampling strategies

One of the challenges practitioners face when applying structure learning algorithms to their data involves determining a set of hyperparameters; otherwise, a set of hyperparameter defaults is assumed. The optimal hyperparameter configuration …

verfasst von:
Kiattikun Chobtham, Anthony C. Constantinou

23.05.2024 | Regular Paper

A visual programming tool for mobile web augmentation

The use of mobile devices for web browsing has increased extraordinarily in recent years, becoming the main source of information. Unfortunately, developers cannot meet the needs of all users. As a result, users have been forced to adapt web …

verfasst von:
Iñigo Aldalur, Alain Perez, Felix Larrinaga, Miren Illarramendi

23.05.2024 | Regular Paper

Reasoning subevent relation over heterogeneous event graph

Subevent relation identification (SRI) is a challenging natural language processing task of great value for knowledge acquisition and reasoning. Given an event pair, previous work mainly defines SRI as a classification task and usually relies on …

verfasst von:
Ting-Ting Wu, Xiao Ding, Li Du, Bing Qin, Ting Liu

23.05.2024 | Regular Paper

CIIR: an approach to handle class imbalance using a novel feature selection technique

The increasing vulnerability of systems and the rise in malicious events have sparked concerns about network security. In order to address these threats, network intrusion detection systems (NIDSs) play a role in protecting against malicious …

verfasst von:
Bidyapati Thiyam, Shouvik Dey

23.05.2024 | Regular Paper

Multiple optimized ensemble learning for high-dimensional imbalanced credit scoring datasets

Credit scoring models are crucial tools for lenders to assess credit risks. Researchers from academia and the financial industry have shown intense interest in these models. However, real credit datasets often have high dimensionality and class …

verfasst von:
Sudhansu R. Lenka, Sukant Kishoro Bisoy, Rojalina Priyadarshini

Open Access 18.05.2024 | Regular Paper

Multi-agent system architecture for winter road maintenance: a real Spanish case study

Road safety remains a critical issue in contemporary society, where the sudden deterioration of road conditions due to weather-related natural phenomena poses significant risks. These abrupt changes can lead to severe safety hazards on the roads …

verfasst von:
Diego M. Jiménez-Bravo, Javier Bajo, Jacinto González-Pachón, Juan F. De Paz

18.05.2024 | Regular Paper

CRAS: cross-domain recommendation via aspect-level sentiment extraction

To address the problem of sparse data and cold-start when facing new users and items in the single-domain recommendation, cross-domain recommendation has gradually become a hot topic in the recommendation system. This method enhances target domain …

verfasst von:
Fan Zhang, Yaoyao Zhou, Pengfei Sun, Yi Xu, Wanjiang Han, Hongben Huang, Jinpeng Chen

17.05.2024 | Review

Fake review detection techniques, issues, and future research directions: a literature review

Recently, the impact of product or service reviews on customers' purchasing decisions has become increasingly significant in online businesses. Consequently, manipulating reviews for fame or profit has become prevalent, with some businesses …

verfasst von:
Ramadhani Ally Duma, Zhendong Niu, Ally S. Nyamawe, Jude Tchaye-Kondi, Nuru Jingili, Abdulganiyu Abdu Yusuf, Augustino Faustino Deve

16.05.2024 | Regular Paper

Improving Alzheimer’s classification using a modified Borda count voting method on dynamic ensemble classifiers

Alzheimer’s detection is a challenging task for physicians. There are subtle differences in the bio-marker characteristics of Alzheimers and mild cognitive impairment patients which is very difficult to detect by a physician. Machine learning …

verfasst von:
K. P. Muhammed Niyas, Thiyagarajan Paramasivan

16.05.2024 | Regular Paper

A new neighbourhood-based diffusion algorithm for personalized recommendation

Object ratings in recommendation algorithms are used to represent the extent to which a user likes an object. Most existing recommender systems use these ratings to recommend the top-K objects to a target user. To improve the accuracy and …

verfasst von:
Diyawu Mumin, Lei-Lei Shi, Lu Liu, Zi-xuan Han, Liang Jiang, Yan Wu

12.05.2024 | Regular Paper

Misclassification-guided loss under the weighted cross-entropy loss framework

As deep neural networks for visual recognition gain momentum, many studies have modified the loss function to improve the classification performance on long-tailed data. Typical and effective improvement strategies are to assign different weights …

verfasst von:
Yan-Xue Wu, Kai Du, Xian-Jie Wang, Fan Min

11.05.2024 | Regular Paper

C22MP: the marriage of catch22 and the matrix profile creates a fast, efficient and interpretable anomaly detector

Many time series data mining algorithms work by reasoning about the relationships the conserved shapes of subsequences. To facilitate this, the Matrix Profile is a data structure that annotates a time series by recording each subsequence’s …

verfasst von:
Sadaf Tafazoli, Yue Lu, Renjie Wu, Thirumalai Vinjamoor Akhil Srinivas, Hannah Dela Cruz, Ryan Mercer, Eamonn Keogh

11.05.2024 | Review

An analysis of large language models: their impact and potential applications

Large language models (LLMs) have transformed the interpretation and creation of human language in the rapidly developing field of computerized language processing. These models, which are based on deep learning techniques like transformer …

verfasst von:
G. Bharathi Mohan, R. Prasanna Kumar, P. Vishal Krishh, A. Keerthinathan, G. Lavanya, Meka Kavya Uma Meghana, Sheba Sulthana, Srinath Doss

10.05.2024 | Regular Paper

A comprehensive ensemble pruning framework based on dual-objective maximization trade-off

Ensemble learning has gotten a lot of interest because of its capacity to increase predictive accuracy by merging numerous models. However, redundant data and a high level of computing complexity frequently plague ensembles. To choose a subset of …

verfasst von:
Anitha Gopalakrishnan, J. Martin Leo Manickam

09.05.2024 | Review

Trends and challenges in sentiment summarization: a systematic review of aspect extraction techniques

Sentiment Summarization is an automated technology that extracts important features of sentences and then reorganizes selected words or sentences by their aspect class and sentiment polarity. This emerging research area wields considerable …

verfasst von:
Nur Hayatin, Suraya Alias, Lai Po Hung

08.05.2024 | Regular Paper

Relational multi-scale metric learning for few-shot knowledge graph completion

Few-shot knowledge graph completion (FKGC) refers to the task of inferring missing facts in a knowledge graph by utilizing a limited number of reference entities. Most FKGC methods assume a single similarity metric, which leads to a single feature …

verfasst von:
Yu Song, Mingyu Gui, Kunli Zhang, Zexi Xu, Dongming Dai, Dezhi Kong

Open Access 08.05.2024 | Brief Report

Big data in transportation: a systematic literature analysis and topic classification

This paper identifies trends in the application of big data in the transport sector and categorizes research work across scientific subfields. The systematic analysis considered literature published between 2012 and 2022. A total of 2671 studies …

verfasst von:
Danai Tzika-Kostopoulou, Eftihia Nathanail, Konstantinos Kokkinos

07.05.2024 | Regular Paper

CG-FHAUI: an efficient algorithm for simultaneously mining succinct pattern sets of frequent high average utility itemsets

The identification of both closed frequent high average utility itemsets (CFHAUIs) and generators of frequent high average utility itemsets (GFHAUIs) has substantial significance because they play an essential and concise role in representing …

verfasst von:
Hai Duong, Tin Truong, Bac Le, Philippe Fournier-Viger

30.04.2024 | Regular Paper

Enhancing sentiment analysis via fusion of multiple embeddings using attention encoder with LSTM

Different embeddings capture various linguistic aspects, such as syntactic, semantic, and contextual information. Taking into account the diverse linguistic facets, we propose a novel hybrid model. This model hinges on the amalgamation of multiple …

verfasst von:
Jitendra Soni, Kirti Mathur

Open Access 26.04.2024 | Regular Paper

Robustness verification of k-nearest neighbors by abstract interpretation

We study the certification of stability properties, such as robustness and individual fairness, of the k-nearest neighbor algorithm (kNN). Our approach leverages abstract interpretation, a well-established program analysis technique that has been …

verfasst von:
Nicolò Fassina, Francesco Ranzato, Marco Zanella