Multiple Additive Neural Networks for Structured and Unstructured Data

April 29, 20262604.26888

cs.LG

TLDR

MANN extends gradient boosting with shallow neural networks for structured and unstructured data, outperforming XGBoost and enhancing robustness.

Key contributions

Extends gradient boosting by using shallow neural networks (CNNs, Capsule NNs) as base learners.
Applies to structured and unstructured data, leveraging specific neural network architectures.
Incorporates advanced heuristics for continuous learning and robust overfitting prevention.
Achieves superior accuracy compared to traditional methods like XGBoost on diverse datasets.

Why it matters

This research introduces a versatile and robust machine learning methodology capable of handling diverse data types. Its superior precision and generalizability make it a significant advancement for complex learning environments. It offers a powerful alternative to traditional boosting methods.

Original Abstract

This paper extends and explains the Multiple Additive Neural Networks (MANN) methodology, an enhancement to the traditional Gradient Boosting framework, utilizing nearly shallow neural networks instead of decision trees as base learners. This innovative approach leverages neural network architectures, notably Convolutional Neural Networks (CNNs) and Capsule Neural Networks, to extend its application to both structured data and unstructured data such as images and audio. For structured data the advantages of capsule neural networks as feature extractors are used and combined with MANN as a classifier. MANN's unique architecture promotes continuous learning and integrates advanced heuristics to combat overfitting, ensuring robustness and reducing sensitivity to hyperparameter settings like learning rate and iterations. Our empirical studies reveal that MANN surpasses traditional methods such as Extreme Gradient Boosting (XGB) in accuracy across well-known datasets. This research demonstrates MANN's superior precision and generalizability, making it a versatile tool for diverse data types and complex learning environments.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers