1. Quick Dopamine¶

Perceiver can consume any kind of data that you give to this, the examples added here demonstrate how powerful this approach can be. I have added code for the following:

[link00] Image classification:
Training a CIFAR10 model, the input image is flattened to a 2D array with shape [1024,3] and the latents are of shape [32,8] and finally classification happens on n_classes = 10 after an average pooling across latent_len. This is how config is set in gperc
from gperc import ImageConfig config = ImageConfig( image_shape=[32,32,3], latent_len=32, latent_dim=32, n_classes = 10, )

[link01] Masked Language Modeling:

Training a BERT model on a few articles, this happens using gperc

from gperc import TextConfig

config = TextConfig(
  latent_dim = 8,
  vocab_size = len(vocabulary),
  max_len = tensor_target.shape[1],
  latent_frac = 0.15
)

[link02] Image Segmentation:

Training a simple segmentation network on image segmentation dataset. In gperc you need to define configuration as follows:

from gperc import ImageConfig

config = ImageConfig(
  image_shape=(224, 224, 3),
  latent_len=32,
  latent_dim=32,
  num_layers=6,
  n_classes=2,
  task="segmentation",
)

[link03] Grokking:

Grokking is the phenomenon of model snapping into place and learn the rule as you keep on training, from this paper.

from gperc import TextConfig

config = TextConfig(
  latent_dim = 8,
  vocab_size = len(vocabulary),
  max_len = tensor_target.shape[1],
  latent_frac = 0.15
)

[link04] Reinforcement Learning:
Train a simple model to solve Cart Pole gym environment.

[link05] Audio Classificaton:

Training a audio classification network on GTZAN dataset. Audio config in gperc can be defined as follows:

from gperc import AudioConfig

config = AudioConfig(
  sample_rate=22050,
  duration=30,
  hop_length=512,
  num_mfcc=13,
  num_segments=10,
  num_channels=1,
  latent_len=32,
  latent_dim=32,
  num_layers=4,
  n_classes=10
)

[link06] Transfer Learning: (WIP)
The real power of transformer encoder model comes from the fact that unsupervised training and transfering that to downstream task for classification helps build really powerful models. I want to have this functionality built into gperc directly. For this I have added a custom built from scratch gperc.Consumer object that manages your data. (WIP) finetuning also requires changing the architecture a little bit, this is also being added as first class in gperc.Perceiver object.

1. Quick Dopamine¶

More 🍰 on the way¶