Hi, I'm Edward. Currently, I'm interested in developing data-driven methods that interact, explore, and learn from the world. My research investigates deep reinforcement learning, perception, and robotics
Here are the codebases of my research projects so far.
scaffolder | Privileged Sensing Scaffolds RL (ICLR'24 Spotlight) | |
planning goals for exploration | Planning Goals for Exploration (ICLR'23 Spotlight) | |
interactive reward functions | Training Robots to Evaluate Robots (CoRL'22 Best Paper Award) | |
robot aware control | Know Thyself: Transferable Visual Control Policies Through Robot-Awareness (ICLR'22) | |
ikea furniture simulation | IKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks (ICRA'21) |
I like studying codebases that are elegantly written and do cool things. Some topics that I found interesting lately: dataloading at scale, Jax renderer, Jax Monte-Carlo Tree Search library.
Some of my side projects:
gpt2 slackbot | Chatting with GPT2 in slack | |
optical illusion | A cool optical illusion |
Something to think about as an ML researcher
I think researchers start out very pure hearted, but can easily end up misled and lost. The incentives of the modern research community, particularly ML, are misaligned with doing good science. To employ an analogy, ML is currently like a hackathon. You are incentivized to put together an MVP that works just enough to pass the appraisal of the judges. You feel obligated to use the shiny new "X" because it will garner public attention. Companies with free t-shirts and kickbacks swarm around you.Yes, some of these things are unavoidable. But if you blindly follow the noise, you may end up in the eye of the storm - at a standstill, with no exit in sight.
Don't be distracted by the noise!