For code samples, please take a look at a couple of files from the TO-BE-PUBLISHED-SOON Ai-Memory Research above

Replicating Frozen with GPT-J

I replicated Deepmind’s Multimodal Few-Shot Learning with Frozen Language Models.

I implemented it using GPT-J as the language model and CLIP’s Vision Transformer as the visual encoder. It uses approximately 6.1 Billion Parameters. Using pre-trained models and preemptible TPUv3-8 enabled me to bring down the training cost to a palatable couple of thousand dollars.

Input Image

Input Prompt

Question 1: In the picture, what color jerseys are the players wearing? Answer: Green tshirts and black shorts. Question 2: What are the players doing? Answer:

Output Text

Question 1: In the picture, what color jerseys are the players wearing? Answer: Green tshirts and black shorts. Question 2: What are the players doing? Answer: They’re warming up for a game

Generative Weather forecasting

I designed a novel deep generative 3D-CNN-Transformer Hybrid architecture to build a computationally tractable global climate forecasting engine.

Modelling the thousand brains theory of intelligence

I proposed a novel convolutional architecture that improves upon traditional CNNs by building upon Jeff Hawkins’ brilliant thousand brains theory of intelligence.

I presented a soft proof of why convolutions can be used as a foundation for cortical mini-columns. This raises the question that whether the success of CNNs can be used as weak empirical proof for the 1000 brains theory.

Breakthrough: Can Giving Memory to Entire Neural Nets be Revolutionary?

AIM, which stands for AI Memory, is a novel method to attach memory to boost the performance of almost all neural networks.

It takes only a few additional lines of code to implement and significantly improves a model's ability to learn and generalize. The proof of concept demonstrated that AIM has the potential to improve the performance of every single neural network in the world.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Replicating Frozen with GPT-J

Input Image

Input Prompt

Output Text

Generative Weather forecasting

Modelling the thousand brains theory of intelligence

Breakthrough: Can Giving Memory to Entire Neural Nets be Revolutionary?

Files

README.md

Latest commit

History

README.md

File metadata and controls

Replicating Frozen with GPT-J

Input Image

Input Prompt

Output Text

Generative Weather forecasting

Modelling the thousand brains theory of intelligence

Breakthrough: Can Giving Memory to Entire Neural Nets be Revolutionary?