OpenAI News

Roboschool

OpenAI News

9 years 1 month ago

We are releasing Roboschool: open-source software for robot simulation, integrated with OpenAI Gym.

Equivalence between policy gradients and soft Q-learning

OpenAI News

9 years 2 months ago

Stochastic Neural Networks for hierarchical reinforcement learning

OpenAI News

9 years 2 months ago

Unsupervised sentiment neuron

OpenAI News

9 years 2 months ago

We’ve developed an unsupervised system which learns an excellent representation of sentiment, despite being trained only to predict the next character in the text of Amazon reviews.

Spam detection in the physical world

OpenAI News

9 years 2 months ago

We’ve created the world’s first Spam-detecting AI trained entirely in simulation and deployed on a physical robot.

Evolution strategies as a scalable alternative to reinforcement learning

OpenAI News

9 years 3 months ago

We’ve discovered that evolution strategies (ES), an optimization technique that’s been known for decades, rivals the performance of standard reinforcement learning (RL) techniques on modern RL benchmarks (e.g. Atari/MuJoCo), while overcoming many of RL’s inconveniences.

One-shot imitation learning

OpenAI News

9 years 3 months ago

Distill

OpenAI News

9 years 3 months ago

We’re excited to support today’s launch of Distill, a new kind of journal aimed at excellent communication of machine learning results (novel or existing).

Learning to communicate

OpenAI News

9 years 3 months ago

In this post we’ll outline new OpenAI research in which agents develop their own language.

Emergence of grounded compositional language in multi-agent populations

OpenAI News

9 years 3 months ago

Prediction and control with temporal segment models

OpenAI News

9 years 3 months ago

Third-person imitation learning

OpenAI News

9 years 3 months ago

Attacking machine learning with adversarial examples

OpenAI News

9 years 4 months ago

Adversarial examples are inputs to machine learning models that an attacker has intentionally designed to cause the model to make a mistake; they’re like optical illusions for machines. In this post we’ll show how adversarial examples work across different mediums, and will discuss why securing systems against them can be difficult.

Adversarial attacks on neural network policies

OpenAI News

9 years 4 months ago

Team update

OpenAI News

9 years 4 months ago

The OpenAI team is now 45 people. Together, we’re pushing the frontier of AI capabilities—whether by validating novel ideas, creating new software systems, or deploying machine learning on robots.

PixelCNN++: Improving the PixelCNN with discretized logistic mixture likelihood and other modifications

OpenAI News

9 years 5 months ago

Faulty reward functions in the wild

OpenAI News

9 years 6 months ago

Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we’ll explore one failure mode, which is where you misspecify your reward function.

Universe

OpenAI News

9 years 6 months ago

We’re releasing Universe, a software platform for measuring and training an AI’s general intelligence across the world’s supply of games, websites and other applications.