Find Jobs
Hire Freelancers

Deep RL expert needed to guide me in my project

min €36 EUR / hour

Closed
Posted about 3 years ago

min €36 EUR / hour

Hello, I have the following personnel ML project: 1. modeling a sort of single-player game with a big board of locations, pawns of different colors and quantities to place on it, and a set of rules to place each kind (color) of pawn on the board. There are also time considerations as a moving a pawn from one location to another takes some time related to the distance... 2. Have a Deep RL algorithm learning to play the game and finding the best solution (highest score in the smallest number of moves). I have only been for a few weeks into Machine Learning stuff starting with openai gym. What I did so far is programming the gym environment, which works with a random agent (I mean the rules work correctly). I tried to train PPO on it but I am not sure if my strategy is the right one regarding action/observation spaces and rewards. I did not really do normalization as I am not sure how to handle it. I am stuck at this stage with a ton of questions and doubts and would need an expert to coach me, and put me in the right direction as I don't have much time for trial and errors but want to learn how to tackle my specific problem... I might also need some help with coding when necessary (I code in Python). Thanks, Johan Brief summary of how my gym env looks like: I coded a gym environment for a single-player game that consists of: - a board of n locations (n is 1122 in this example but, in the future, I would like to be able to handle boards of 30000 locations for instance), represented by a simple list of 1122 indexes - 6 different kinds (colors) of pawns that you can place according to a set of rules that is handled by the gym environment (some pawns can pile up on the same locations, etc.). At the beginning, the player has a fixed number of available pawns per color (stock), which I represent by the code 1 to 6. - 3 possible pawn actions: NEW, when putting a new pawn on the board from the stock, MOVE, when moving a pawn already on the board to a new location), and REMOVE, when removing a pawn from the board to put it back in the stock. As an action_space I used a MultiDiscrete([pawn_actions_nr, total_pawns_nr, locations_nr]), where: - pawn_actions_nr = 0 (NEW), 1 (MOVE) or 2 (REMOVE) - total_pawns_nr = int from 0 to 60 with 0 to 6 being the 7 black pawns, 7 to 10 the 4 red pawns, and so on - locations_nr = 0 to 1122, representing each of the 1122 possible locations Every time a pawn stops at a location, the location takes its color (ex: I can place a red pawn on a given location and then move it to different locations, all these locations will turn red). Observation space: a box of (1122+60) length, values can be integers from -1 to 1122. The first 1122 represent the index of the locations (and can take value from 0 to 6, 0 being the initial state and 1 to 6 the color of the location) and the last 60 represent all the available pawns from the initial stock, with a possible value from -1 to 1122, representing the location where a given is located, -1 meaning that it is not in the board but still in stock. The environment does not manage the time so far as I am not sure how to handle the moving delay (do I have to set a fixed time per step and manage past present and future some how? I there a way to handle that as discrete simulations do?...)
Project ID: 28938998

About the project

8 proposals
Remote project
Active 3 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
8 freelancers are bidding on average €42 EUR/hour for this job
User Avatar
Hi, I am Ibrahim and I am a data scientist, I can help you with RL, please share what is the desired variation. Regards, Ibrahim Anjum
€36 EUR in 10 days
4.9 (50 reviews)
5.8
5.8
User Avatar
Hello jomo78,   We have 20 years of strong experience in Python, Machine Learning (ML), Deep Learning, as a result, we can successfully complete this project.   Please, review our profile here: https://www.freelancer.com/u/tangramua Here detailed information about our company, our portfolio, and the client's recent reviews.   Also, we wanted to personally discuss questions about your project, which helps us to give you the right estimation.   Best regards, Tangram Canada Inc.   P.S. As seen from the details of your profile, you located in Europe. Be informed that our technical offices located in Ukraine i.e. you will be able to work with our developers almost in the same time zone. 
€79 EUR in 5 days
4.7 (12 reviews)
5.8
5.8
User Avatar
Hi, there. I read your description and I am interested in your project. I am a ✪Depp Learning/Machine Learning/Python✪ Expert who you are looking for and have +7 years experience I am familiar in a lot of Python modules & algorithms I am confident in your project and I can finish it clearly on time. Working with me, you will have good experience and save a lot of time. Please contact me kindly and lets discuss in more detail. Thank you. Milos
€36 EUR in 40 days
5.0 (20 reviews)
5.1
5.1
User Avatar
Hi, I hope you are doing fine. I have almost 10 years of experience in machine learning algorithms. I can implement various types of artificial intelligence algorithms including yours with Matlab, Python and etc. I have PhD from Tohoku University and have several journal publications on the subjects. You can see portfolio for my previous projects. I read about your project and am interested in working with you. Please send me a message so that we can discuss more. Best regards.
€36 EUR in 40 days
5.0 (4 reviews)
3.8
3.8
User Avatar
I am a high school student and also a Machine Learning Engineer with an incredibly curious urge to apply AI to day-to-day modern problems. Skills: Python, C++ and Golang developer, TensorFlow Developer, PyTorch, Computer Vision, Neural Networks, Linux, Docker, Kubernetes, Google Cloud Platform, Data Engineering, Competitive Programmer, Data Interpretation
€36 EUR in 40 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Hi, We are a team of data science and ML/AI experts who excel across multiple areas with more than twenty five years of combined experience. We hold expertise in Python, Backend Architecture (micro-services, Kubernetes), Data Science, Machine Learning, NLP, Deep Learning (CNN, RNN/LSTM/GRU, Attention based architectures), Reinforcement Learning, Time Series Analysis, Data Visualisations, DevOps, Web scraping and Big Data including Hadoop, Spark, MongoDb, Elastic Search, Redis and NoSQL databases. We have worked with several MNCs in the domain of Travel, Telecom, Media, Oil & Gas, Messaging and other verticals. We have developed products driven by AI technology innovations with full-stack end-to-end design and development on Cloud (AWS + GCP) over terabytes of transactional and clickstream data.  For details about our capability and projects we have completed you can check our profile. Looking forward to assist you. Warm Regards.
€40 EUR in 40 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of FRANCE
Annecy, France
0.0
0
Payment method verified
Member since Jan 15, 2021

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.