Pseudoinvertible Neural Networks

Elijah D. Bolluyt, Cristina Comaniciu

Research output: Contribution to journalArticlepeer-review

Abstract

This work introduces novel architecture components and training procedures to create augmented neural networks with the ability to process data bidirectionally via an end-to-end approximate inverse. We develop pseudoinvertible neural network (PsI-NN) layers which function as drop-in replacements for corresponding convolutional and fully connected layers; by using these, existing architectures gain a pseudoinverse function which, with training, approximately reverses the forward function. For cases where learning both a task and its inverse are necessary or desirable, we show that PsI-NN enabled models match or exceed the quality of results generated by systems that use two separate models while drastically reducing system parameter count. We demonstrate this on two tasks: unpaired image translation and semisupervised classification. In both, PsI-NN greatly reduces parameter count without any loss of output quality; in semisupervised image classification, PsI-NN improves classification performance beyond the baseline autoencoder method. Our approach creates pseudoinvertible versions of existing architectures, circumventing the stringent constraints required to create a true inverse while allowing a single neural network to learn a task in two directions simultaneously.

Original languageEnglish
Pages (from-to)602-612
Number of pages11
JournalIEEE Transactions on Artificial Intelligence
Volume5
Issue number2
DOIs
StatePublished - 1 Feb 2024

Keywords

  • Semisupervised learning
  • convolutional neural networks (CNN)
  • unsupervised learning

Fingerprint

Dive into the research topics of 'Pseudoinvertible Neural Networks'. Together they form a unique fingerprint.

Cite this