rlcard-showdown/README.md

# RLCard Showdown
This is the GUI support for the [RLCard](https://github.com/datamllab/rlcard) project and DouZero project. RLCard-Showdown provides evaluation and visualization tools to help understand the performance of the agents. It includes a replay module, where you can analyze the replays, and a PvE module, where you can play with the AI interactively. Currently, we only support Leduc Hold'em and Dou Dizhu. The frontend is developed with [React](https://reactjs.org/). The backend is based on [Django](https://www.djangoproject.com/) and [Flask](https://flask.palletsprojects.com/). Have fun!

*   Official Website: [http://www.rlcard.org](http://www.rlcard.org)
*   Tutorial in Jupyter Notebook: [https://github.com/datamllab/rlcard-tutorial](https://github.com/datamllab/rlcard-tutorial)
*   Paper: [https://www.ijcai.org/Proceedings/2020/764](https://www.ijcai.org/Proceedings/2020/764)
*   Document: [Click Here](docs/README.md)

## Cite this work
Zha, Daochen, Kwei-Herng Lai, Songyi Huang, Yuanpu Cao, Keerthana Reddy, Juan Vargas, Alex Nguyen, Ruzhe Wei, Junyu Guo, and Xia Hu. "RLCard: A Platform for Reinforcement Learning in Card Games." In IJCAI. 2020.

```bibtex
@inproceedings{zha2020rlcard,
  title={RLCard: A Platform for Reinforcement Learning in Card Games},
  author={Zha, Daochen and Lai, Kwei-Herng and Huang, Songyi and Cao, Yuanpu and Reddy, Keerthana and Vargas, Juan and Nguyen, Alex and Wei, Ruzhe and Guo, Junyu and Hu, Xia},
  booktitle={IJCAI},
  year={2020}
}
```

## Installation
RLCard-Showdown has separated frontend and backend. The frontend is built with React and the backend is based on Django and Flask.

### Prerequisite
To set up the frontend, you should make sure you have [Node.js](https://nodejs.org/) and NPM installed. Normally you just need to manually install Node.js, and the NPM package would be automatically installed together with Node.js for you. Please refer to its official website for installation of Node.js.

You can run the following commands to verify the installation
```
node -v
npm -v
```
For backend, make sure that you have **Python 3.6+** and **pip** installed.

### Install Frontend and Backend
The frontend can be installed with the help of NPM:
```
git clone -b master --single-branch --depth=1 https://github.com/datamllab/rlcard-showdown.git
cd rlcard-showdown
npm install
```
The backend of leaderboard can be installed with
```
pip3 install -r requirements.txt
cd server
python3 manage.py migrate
cd ..
```

### Run RLCard-Showdown
1. Launch the backend of leaderboard with
```
cd server
python3 manage.py runserver
```
2. Download the pre-trained models in [Google Drive](https://drive.google.com/file/d/1zx-20xNBDbCFd8GWhZFUkl07lofbNHpy/view?usp=sharing) or [百度网盘](https://pan.baidu.com/s/12MgxVBBz4mgitT74quSWfw) 提取码: qh6s. Extract it in `pve_server/pretrained`.

In a new terminal, start the PvE server (i.e., human vs AI) of DouZero with
```
cd pve_server
python3 run_douzero.py
```
Alternatively, you can start the PvE server interfaced with RLCard:
```
cd pve_server
python3 run_dmc.py
```
They are conceptually the same with minor differences in state representation and training time of the pre-trained models (DouZero is fully trained with more than a month, while DMC in RLCard is only trained for hours).

3. Run the following command in another new terminal under the project folder to start frontend:
```
npm start
```
You can view leaderboard at [http://127.0.0.1:3000/](http://127.0.0.1:3000/) and PvE demo of Dou Dizhu at [http://127.0.0.1:3000/pve/doudizhu-demo](http://127.0.0.1:3000/pve/doudizhu-demo). The backend of leaderboard will run in [http://127.0.0.1:8000/](http://127.0.0.1:8000/). The PvE backend will run in [http://127.0.0.1:5000/](http://127.0.0.1:5000/).

## Demos
![leaderboards](https://github.com/datamllab/rlcard-showdown/blob/master/docs/imgs/leaderboards.png?raw=true)
![upload](https://github.com/datamllab/rlcard-showdown/blob/master/docs/imgs/upload.png?raw=true)
![doudizhu-replay](https://github.com/datamllab/rlcard-showdown/blob/master/docs/imgs/doudizhu-replay.png?raw=true)
![leduc-replay](https://github.com/datamllab/rlcard-showdown/blob/master/docs/imgs/leduc-replay.png?raw=true)

## Contact Us
If you have any questions or feedback, feel free to drop an email to [Songyi Huang](https://github.com/hsywhu) for the frontend or [Daochen Zha](https://github.com/daochenzha) for backend.

## Acknowledgements
We would like to thank JJ World Network Technology Co., LTD for the generous support, [Chieh-An Tsai](https://anntsai.myportfolio.com/) for user interface design, and [Lei Pan](https://github.com/lpan18) for the help in visualizations.
Fix delete agent bug and update README 2020-12-07 01:21:49 +08:00			`# RLCard Showdown`
Add RLCard PvE 2021-05-29 04:43:44 +08:00			This is the GUI support for the [RLCard](https://github.com/datamllab/rlcard) project and DouZero project. RLCard-Showdown provides evaluation and visualization tools to help understand the performance of the agents. It includes a replay module, where you can analyze the replays, and a PvE module, where you can play with the AI interactively. Currently, we only support Leduc Hold'em and Dou Dizhu. The frontend is developed with [React](https://reactjs.org/). The backend is based on [Django](https://www.djangoproject.com/) and [Flask](https://flask.palletsprojects.com/). Have fun!
Fix delete agent bug and update README 2020-12-07 01:21:49 +08:00
			`* Official Website: [http://www.rlcard.org](http://www.rlcard.org)`
			`* Tutorial in Jupyter Notebook: [https://github.com/datamllab/rlcard-tutorial](https://github.com/datamllab/rlcard-tutorial)`
			`* Paper: [https://www.ijcai.org/Proceedings/2020/764](https://www.ijcai.org/Proceedings/2020/764)`
Upload PvE 2021-05-29 00:25:02 +08:00			`* Document: [Click Here](docs/README.md)`
Fix delete agent bug and update README 2020-12-07 01:21:49 +08:00
			`## Cite this work`
Upload PvE 2021-05-29 00:25:02 +08:00			`Zha, Daochen, Kwei-Herng Lai, Songyi Huang, Yuanpu Cao, Keerthana Reddy, Juan Vargas, Alex Nguyen, Ruzhe Wei, Junyu Guo, and Xia Hu. "RLCard: A Platform for Reinforcement Learning in Card Games." In IJCAI. 2020.`

			```bibtex
			`@inproceedings{zha2020rlcard,`
			`title={RLCard: A Platform for Reinforcement Learning in Card Games},`
			`author={Zha, Daochen and Lai, Kwei-Herng and Huang, Songyi and Cao, Yuanpu and Reddy, Keerthana and Vargas, Juan and Nguyen, Alex and Wei, Ruzhe and Guo, Junyu and Hu, Xia},`
			`booktitle={IJCAI},`
			`year={2020}`
Fix delete agent bug and update README 2020-12-07 01:21:49 +08:00			`}`
			```

			`## Installation`
Upload PvE 2021-05-29 00:25:02 +08:00			`RLCard-Showdown has separated frontend and backend. The frontend is built with React and the backend is based on Django and Flask.`
add frontend readme 2020-12-05 05:24:31 +08:00
			`### Prerequisite`
Fix delete agent bug and update README 2020-12-07 01:21:49 +08:00			`To set up the frontend, you should make sure you have [Node.js](https://nodejs.org/) and NPM installed. Normally you just need to manually install Node.js, and the NPM package would be automatically installed together with Node.js for you. Please refer to its official website for installation of Node.js.`
add frontend readme 2020-12-05 05:24:31 +08:00
			`You can run the following commands to verify the installation`
			```
			`node -v`
			`npm -v`
			```
Add DQN 2021-05-28 14:15:41 +08:00			`For backend, make sure that you have Python 3.6+ and pip installed.`
add frontend readme 2020-12-05 05:24:31 +08:00
Fix delete agent bug and update README 2020-12-07 01:21:49 +08:00			`### Install Frontend and Backend`
			`The frontend can be installed with the help of NPM:`
add frontend readme 2020-12-05 05:24:31 +08:00			```
Upload PvE 2021-05-29 00:25:02 +08:00			`git clone -b master --single-branch --depth=1 https://github.com/datamllab/rlcard-showdown.git`
add frontend readme 2020-12-05 05:24:31 +08:00			`cd rlcard-showdown`
			`npm install`
			```
Add DQN 2021-05-28 14:15:41 +08:00			`The backend of leaderboard can be installed with`
Django init 2020-05-11 13:36:49 +08:00			```
Add DQN 2021-05-28 14:15:41 +08:00			`pip3 install -r requirements.txt`
Django init 2020-05-11 13:36:49 +08:00			`cd server`
Add DQN 2021-05-28 14:15:41 +08:00			`python3 manage.py migrate`
Fix delete agent bug and update README 2020-12-07 01:21:49 +08:00			`cd ..`
Django init 2020-05-11 13:36:49 +08:00			```
Fix delete agent bug and update README 2020-12-07 01:21:49 +08:00
			`### Run RLCard-Showdown`
Upload PvE 2021-05-29 00:25:02 +08:00			`1. Launch the backend of leaderboard with`
Django init 2020-05-11 13:36:49 +08:00			```
Update README.md 2020-12-07 01:27:48 +08:00			`cd server`
Add DQN 2021-05-28 14:15:41 +08:00			`python3 manage.py runserver`
Django init 2020-05-11 13:36:49 +08:00			```
Add RLCard PvE 2021-05-29 04:43:44 +08:00			2. Download the pre-trained models in [Google Drive](https://drive.google.com/file/d/1zx-20xNBDbCFd8GWhZFUkl07lofbNHpy/view?usp=sharing) or [百度网盘](https://pan.baidu.com/s/12MgxVBBz4mgitT74quSWfw) 提取码: qh6s. Extract it in `pve_server/pretrained`.

			`In a new terminal, start the PvE server (i.e., human vs AI) of DouZero with`
Upload PvE 2021-05-29 00:25:02 +08:00			```
			`cd pve_server`
			`python3 run_douzero.py`
			```
Add RLCard PvE 2021-05-29 04:43:44 +08:00			`Alternatively, you can start the PvE server interfaced with RLCard:`
			```
			`cd pve_server`
			`python3 run_dmc.py`
			```
			`They are conceptually the same with minor differences in state representation and training time of the pre-trained models (DouZero is fully trained with more than a month, while DMC in RLCard is only trained for hours).`

Upload PvE 2021-05-29 00:25:02 +08:00			`3. Run the following command in another new terminal under the project folder to start frontend:`
Fix list baseline bug and update README 2020-12-07 03:24:59 +08:00			```
			`npm start`
			```
Upload PvE 2021-05-29 00:25:02 +08:00			`You can view leaderboard at [http://127.0.0.1:3000/](http://127.0.0.1:3000/) and PvE demo of Dou Dizhu at [http://127.0.0.1:3000/pve/doudizhu-demo](http://127.0.0.1:3000/pve/doudizhu-demo). The backend of leaderboard will run in [http://127.0.0.1:8000/](http://127.0.0.1:8000/). The PvE backend will run in [http://127.0.0.1:5000/](http://127.0.0.1:5000/).`
Django init 2020-05-11 13:36:49 +08:00
Upload PvE 2021-05-29 00:25:02 +08:00			`## Demos`
clean code / reorganize doc structure 2020-12-19 03:10:48 +08:00			`![leaderboards](https://github.com/datamllab/rlcard-showdown/blob/master/docs/imgs/leaderboards.png?raw=true)`
			`![upload](https://github.com/datamllab/rlcard-showdown/blob/master/docs/imgs/upload.png?raw=true)`
			`![doudizhu-replay](https://github.com/datamllab/rlcard-showdown/blob/master/docs/imgs/doudizhu-replay.png?raw=true)`
			`![leduc-replay](https://github.com/datamllab/rlcard-showdown/blob/master/docs/imgs/leduc-replay.png?raw=true)`
Leduc 2020-05-12 05:42:08 +08:00
Upload PvE 2021-05-29 00:25:02 +08:00			`## Contact Us`
Add DQN 2021-05-28 14:15:41 +08:00			`If you have any questions or feedback, feel free to drop an email to [Songyi Huang](https://github.com/hsywhu) for the frontend or [Daochen Zha](https://github.com/daochenzha) for backend.`
Leduc 2020-05-12 05:42:08 +08:00
Upload PvE 2021-05-29 00:25:02 +08:00			`## Acknowledgements`
			`We would like to thank JJ World Network Technology Co., LTD for the generous support, [Chieh-An Tsai](https://anntsai.myportfolio.com/) for user interface design, and [Lei Pan](https://github.com/lpan18) for the help in visualizations.`