Journal of Classification OnlineFirst articles

30.05.2024 | Original Research

Finding Outliers in Gaussian Model-based Clustering

Clustering, or unsupervised classification, is a task often plagued by outliers. Yet there is a paucity of work on handling outliers in clustering. Outlier identification algorithms tend to fall into three broad categories: outlier inclusion …

verfasst von:: Katharine M. Clark, Paul D. McNicholas

17.05.2024

SNN-PDM: An Improved Probability Density Machine Algorithm Based on Shared Nearest Neighbors Clustering Technique

Probability density machine (PDM) is a novel algorithm which was proposed recently for addressing class imbalance learning (CIL) problem. PDM can capture priori data distribution information well and present robust performance in various CIL …

verfasst von:: Shiqi Wu, Hualong Yu, Yan Gu, Changbin Shao, Shang Gao

Open Access 11.05.2024

A Novel Classification Algorithm Based on the Synergy Between Dynamic Clustering with Adaptive Distances and K-Nearest Neighbors

This paper introduces a novel supervised classification method based on dynamic clustering (DC) and K-nearest neighbor (KNN) learning algorithms, denoted DC-KNN. The aim is to improve the accuracy of a classifier by using a DC method to discover …

verfasst von:: Mohammed Sabri, Rosanna Verde, Antonio Balzanella, Fabrizio Maturo, Hamid Tairi, Ali Yahyaouy, Jamal Riffi

PDF-Version jetzt herunterladen Zum Volltext

09.05.2024 | Original Research

Accelerated Sequential Data Clustering

Data clustering is an important task in the field of data mining. In many real applications, clustering algorithms must consider the order of data, resulting in the problem of clustering sequential data. For instance, analyzing the moving pattern …

verfasst von:: Reza Mortazavi, Elham Enayati, Abdolali Basiri

06.05.2024 | Special Issue: IFCS 2022

Skew Multiple Scaled Mixtures of Normal Distributions with Flexible Tail Behavior and Their Application to Clustering

The family of multiple scaled mixtures of multivariate normal (MSMN) distributions has been shown to be a powerful tool for modeling data that allow different marginal amounts of tail weight. An extension of the MSMN distribution is proposed …

verfasst von:: Abbas Mahdavi, Anthony F. Desmond, Ahad Jamalizadeh, Tsung-I Lin

Open Access 04.03.2024

Inferential Tools for Assessing Dependence Across Response Categories in Multinomial Models with Discrete Random Effects

We propose a discrete random effects multinomial regression model to deal with estimation and inference issues in the case of categorical and hierarchical data. Random effects are assumed to follow a discrete distribution with an a priori unknown …

verfasst von:: Chiara Masci, Francesca Ieva, Anna Maria Paganoni

PDF-Version jetzt herunterladen Zum Volltext

04.03.2024

Binary Peacock Algorithm: A Novel Metaheuristic Approach for Feature Selection

Binary metaheuristic algorithms prove to be invaluable for solving binary optimization problems. This paper proposes a binary variant of the peacock algorithm (PA) for feature selection. PA, a recent metaheuristic algorithm, is built upon lekking …

verfasst von:: Hema Banati, Richa Sharma, Asha Yadav

Open Access 19.02.2024 | Erratum

Erratum to: Variable Selection for Hidden Markov Models with Continuous Variables and Missing Data

verfasst von:: Fulvia Pennoni, Francesco Bartolucci, Silvia Pandolfi

PDF-Version jetzt herunterladen Zum Volltext

Open Access 23.01.2024

Variable Selection for Hidden Markov Models with Continuous Variables and Missing Data

We propose a variable selection method for multivariate hidden Markov models with continuous responses that are partially or completely missing at a given time occasion. Through this procedure, we achieve a dimensionality reduction by selecting …

verfasst von:: Fulvia Pennoni, Francesco Bartolucci, Silvia Pandolfi

PDF-Version jetzt herunterladen Zum Volltext

08.01.2024

Parsimonious Seemingly Unrelated Contaminated Normal Cluster-Weighted Models

Normal cluster-weighted models constitute a modern approach to linear regression which simultaneously perform model-based cluster analysis and multivariate linear regression analysis with random quantitative regressors. Robustified models have …

verfasst von:: Gabriele Perrone, Gabriele Soffritti

07.12.2023

funLOCI: A Local Clustering Algorithm for Functional Data

Nowadays, an increasing number of problems involve data with one infinite continuous dimension known as functional data. In this paper, we introduce the funLOCI algorithm, which enables the identification of functional local clusters or functional …

verfasst von:: Jacopo Di Iorio, Simone Vantini

31.10.2023

Missing Values and Directional Outlier Detection in Model-Based Clustering

Model-based clustering tackles the task of uncovering heterogeneity in a data set to extract valuable insights. Given the common presence of outliers in practice, robust methods for model-based clustering have been proposed. However, the use of …

verfasst von:: Hung Tong, Cristina Tortora

18.08.2023 | Special Issue: IFCS 2022

Two Simple but Efficient Algorithms to Recognize Robinson Dissimilarities

A dissimilarity d on a set S of size n is said to be Robinson if its matrix can be symmetrically permuted so that its elements do not decrease when moving away from the main diagonal along any row or column. Equivalently, S admits a total order < …

verfasst von:: M. Carmona, V. Chepoi, G. Naves, P. Préa

12.06.2023 | Special Issue: IFCS 2022

Matrix-Variate Hidden Markov Regression Models: Fixed and Random Covariates

Two families of matrix-variate hidden Markov regression models (MV-HMRMs) are here introduced. The distinction between them relies on the role of the covariates, which can be treated as fixed or random. Parsimony is achieved by using the …

verfasst von:: Salvatore D. Tomarchio, Antonio Punzo, Antonello Maruotti

Springer Professional