Solving the Travelling Salesperson Problem with deep reinforcement learning on Amazon SageMaker

Introduction

The Travelling Salesperson Problem (TSP) is one of the most popular NP-hard combinatorial problems in the theoretical computer science and operations research (OR) community. It asks the following question: "Given a list of cities and the distances between each pair of cities, what is the shortest possible route that visits each city exactly once and returns to the origin city?".

The problem has been studied for decades, and many traditional optimization algorithms have been proposed to solve it, such as dynamic programming and branch-and-bound. Although these optimization algorithms are capable of solving TSP with dozens of nodes, it is usually intractable to use these algorithms to solve optimally above thousands of nodes on modern computers due to their exponential execution times.

In this repository, we demonstrate show how to train, deploy, and make inferences using deep reinforcement learning to solve the Travelling Salesperson Problem.

For additional explanation, see the forthcoming blog post: Solving the Travelling Salesperson Problem with deep reinforcement learning on Amazon SageMaker

Getting Started

1. Create a SageMaker notebook instance.

This repository is meant to be run on a SageMaker notebook instance. For details on how to create a notebook instance, see the aws documentation.

2. Clone the repository with submodule into the SageMaker directory.

This will clone the current repository as well as the submodule repository: learning-tsp.

cd SageMaker
git clone --recurse-submodules https://github.com/aws-samples/amazon-sagemaker-tsp-deep-rl.git
cd amazon-sagemaker-tsp-deep-rl

From here on out, scripts are to be run from the git project root.

3. Create the virtual environment and install dependencies.

scripts/build_env.sh

This would be a good time to grab a coffee or tea. This step takes a few minutes to run. This step does not need to be repeated on notebook restart (see Restarting the notebook).

4. Combine relevant files into single source directory for SageMaker.

This will combine all of the training and inference code in a single source directory and create a model.tar.gz file for inference with a pre-trained model.

scripts/set_up_sagemaker.sh

Training (Optional)

Open the notebook named notebooks/pytorch_training.ipynb to see how to train on multiple GPU nodes on SageMaker.

Note that this step is optional.

To run training you need to have 18-19 GB of available disk space on your notebook instance to download the training data.

Inference

Open the notebook titled notebooks/pytorch_inference.ipynb to see how to run inference in three different ways:

Locally on the notebook instance
SageMaker Endpoint
Batch Transform

Streamlit Demo

1. Update the Jupyter Notebook instance environment for hosting streamlit.

scripts/set_up_streamlit.sh

2. Run the steamlit app.

WORKING_DIR=./.myenv
# get the env name
line=$(head -n 1 environment.yml)
ENV_NAME="${line/name:\ /}"
source "$WORKING_DIR/miniconda/bin/activate"
conda activate $ENV_NAME

streamlit run src/streamlit_demo.py

3. View the streamlit app via a browser.

Go to https://$YourInstance$.notebook.$YourRegion$.sagemaker.aws/proxy/8501/

Restarting the notebook

After you start/stop a SageMaker notebook instance, you do not need to re-install the packages. Simply open a SageMaker terminal session and run:

cd SageMaker/amazon-sagemaker-tsp-deep-rl
scripts/start_env.sh

Acknowledgements

This code base is an extension of Chaitanya Joshi's excellent repo learning-tsp.

For additional detail check out the paper (Joshi et al., 2021): Learning TSP Requires Rethinking Generalization.

Security

See CONTRIBUTING for more information.

License

This code is licensed under the MIT-0 License. See the LICENSE file.

This code downloads and installs Miniconda. See here for the end-user-license-agreement.

Name	Name	Last commit message	Last commit date
Latest commit yinsong1986 Update README.md Sep 27, 2021 6832c61 · Sep 27, 2021 History 44 Commits
learning-tsp @ bb28f57	learning-tsp @ bb28f57	Reset learning-tsp to upstream master	Sep 2, 2021
notebooks	notebooks	Update data download directory in model training	Sep 9, 2021
scripts	scripts	Minor comment update	Sep 14, 2021
src	src	Move scripts into their own directory	Sep 3, 2021
.gitignore	.gitignore	Add python nb checkpoints to gitignore	Sep 3, 2021
.gitmodules	.gitmodules	Update README	Aug 30, 2021
CODE_OF_CONDUCT.md	CODE_OF_CONDUCT.md	Initial commit	Aug 26, 2021
CONTRIBUTING.md	CONTRIBUTING.md	Initial commit	Aug 26, 2021
LICENSE	LICENSE	Initial commit	Aug 26, 2021
README.md	README.md	Update README.md	Sep 27, 2021
environment.yml	environment.yml	Make all notebooks to run in custom kernal	Sep 8, 2021
miniconda-eula.txt	miniconda-eula.txt	Add miniconda eula	Aug 30, 2021
requirements.txt	requirements.txt	Add requirements.txt for setup	Sep 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Solving the Travelling Salesperson Problem with deep reinforcement learning on Amazon SageMaker

Introduction

Getting Started

Training (Optional)

Inference

Streamlit Demo

Restarting the notebook

Acknowledgements

Security

License

Authors

About

Releases

Packages

Contributors 4

Languages

License

aws-samples/amazon-sagemaker-tsp-deep-rl

Folders and files

Latest commit

History

Repository files navigation

Solving the Travelling Salesperson Problem with deep reinforcement learning on Amazon SageMaker

Introduction

Getting Started

Training (Optional)

Inference

Streamlit Demo

Restarting the notebook

Acknowledgements

Security

License

Authors

About

Topics

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages