2048-expectimax

Simulating an AI playing 2048 using the Expectimax algorithm

The base game engine uses code from here.

The AI player is modeled as a max player, and the computer as a chance player (picking a random open spot to place a 2-tile). The score returned by the game engine is used as the evaluation function value at the leaf nodes of the trees.

You can play the game manually using the arrow keys. Pressing 'Enter' will let the AI play, and pressing 'Enter' again will stop the AI player. Read the game engine code from 'game.py' and see how it returns the game state and evaluate its score from an arbitrary game state after an arbitrary player move.

A depth-3 game tree means the tree should have the following levels:

root: player
level 1: computer
level 2: player
level 3: terminal with payoff (note that we say "terminal" to mean the leaf nodes in the shallow game tree, not the termination of the game itself)

This tree represents all the game states of a player-computer-player sequence (the player makes a move, the computer place a tile, and then the player makes another move, and then evaluate the score) from the current state.

Usage

To run the program:

    python main.py

Once your program is running, here are a few keyboard options available in-game:

'r': restart the game
'u': undo a move
'3'-'7': change board size
'g': toggle grayscale

Simulating an AI playing 2048 using the Expectimax algorithm

Related tags

Overview

2048-expectimax

Usage

Owner

Subha Ramesh

StyleGAN-NADA: CLIP-Guided Domain Adaptation of Image Generators

JAX + dataclasses

[ICLR 2022] DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR

Winners of DrivenData's Overhead Geopose Challenge

Reinforcement Learning Theory Book (rus)

PyTorch implementation(s) of various ResNet models from Twitch streams.

[ICLR 2022] Contact Points Discovery for Soft-Body Manipulations with Differentiable Physics

Temporal Segment Networks (TSN) in PyTorch

《LXMERT: Learning Cross-Modality Encoder Representations from Transformers》(EMNLP 2020)

A NSFW content filter.

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Official code for our ICCV paper: "From Continuity to Editability: Inverting GANs with Consecutive Images"

PyTorch Implementation of Vector Quantized Variational AutoEncoders.

ROS-UGV-Control-Interface - Control interface which can be used in any UGV

[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021)

Hidden-Fold Networks (HFN): Random Recurrent Residuals Using Sparse Supermasks

a reccurrent neural netowrk that when trained on a peice of text and fed a starting prompt will write its on 250 character text using LSTM layers

DIP-football - A football video analyse system based on Yolov5, alphapose, Qt6

The repo of the preprinting paper "Labels Are Not Perfect: Inferring Spatial Uncertainty in Object Detection"