reinforcement search results

Model-based deep reinforcement learning for accelerated learning from flow simulations arxiv.org \| Yesterday	Summary: In recent years, deep reinforcement learning has emerged as a technique to solve closed-loop flow control problems. Employing simulation-based environments in reinforcement learning enables a priori end-to-end optimization of the control system, provides a virtual testbed for safety-critical control applications, and allows to gain a deep understanding of the control mechanisms. While reinforcement learning has been applied successfully in a number of rather simple flow control benchmarks, a maj... Keywords: test, reinforcement learning, optimization
Decentralized fused-learner architectures for Bayesian reinforcement learning www.sciencedirect.com \| Today	Summary: Publication date June 2024Source Artificial Intelligence, Volume 331Author s Augustin A. Saucan, Subhro Das, Moe Z. Win... Keywords: reinforcement learning, artificial intelligence
Using deep reinforcement learning to promote sustainable human behaviour on a common pool resource problem arxiv.org \| Yesterday	Summary: A canonical social dilemma arises when finite resources are allocated to a group of people, who can choose to either reciprocate with interest, or keep the proceeds for themselves. What resource allocation mechanisms will encourage levels of reciprocation that sustain the commons? Here, in an iterated multiplayer trust game, we use deep reinforcement learning (RL) to design an allocation mechanism that endogenously promotes sustainable contributions from human participants to a common pool resou... Keywords: rust, design, game, rl ,
Non-ergodicity in reinforcement learning: robustness via ergodicity transformations arxiv.org \| Yesterday	Summary: Envisioned application areas for reinforcement learning (RL) include autonomous driving, precision agriculture, and finance, which all require RL agents to make decisions in the real world. A significant challenge hindering the adoption of RL methods in these domains is the non-robustness of conventional algorithms. In this paper, we argue that a fundamental issue contributing to this lack of robustness lies in the focus on the expected value of the return as the sole ``correct'' optimization ob... Keywords: rl , algorithms, reinforcement learning,
Top 30 Machine Learning Interview Questions in 2024 www.analyticsinsight.net \| Today	Summary: Master Machine Learning interviews top 30 questions for 2024 Machine Learning is branch of AI that empowers interaction with data rather than programming to predict more accurate results using historical data as base. The value of big data for enter... Keywords: regression, unsupervised, correlation, design, artificial
/mesophil/ Reinforcement Learning with Generative Models for Compact Support Sets paperswithcode.com \| Today	Summary: In this work, we propose framework utilizing reinforcement learning as control for foundation models, allowing for the granular generation of small, focused synthetic support sets to augment the performance of neural network models on real data class... Keywords: reinforcement learning, generative, network, generative
The Power of Resets in Online Reinforcement Learning arxiv.org \| Yesterday	Summary: Simulators are a pervasive tool in reinforcement learning, but most existing algorithms cannot efficiently exploit simulator access -- particularly in high-dimensional domains that require general function approximation. We explore the power of simulators through online reinforcement learning with {local simulator access} (or, local planning), an RL protocol where the agent is allowed to reset to previously observed states and follow their dynamics during training. We use local simulator access ... Keywords: algorithms, reinforcement learning, rl
Developing A Multi-Agent and Self-Adaptive Framework with Deep Reinforcement Learning for Dynamic Portfol... arxiv.org \| Yesterday	Summary: Deep or reinforcement learning (RL) approaches have been adapted as reactive agents to quickly learn and respond with new investment strategies for portfolio management under the highly turbulent financial market environments in recent years. In many cases, due to the very complex correlations among various financial sectors, and the fluctuating trends in different financial markets, a deep or reinforcement learning based agent can be biased in maximising the total returns of the newly formulate... Keywords: react, rl , reinforcement learning,
Deep Reinforcement Learning for Bipedal Locomotion: A Brief Survey arxiv.org \| Yesterday	Summary: Bipedal robots are garnering increasing global attention due to their potential applications and advancements in artificial intelligence, particularly in Deep Reinforcement Learning (DRL). While DRL has driven significant progress in bipedal locomotion, developing a comprehensive and unified framework capable of adeptly performing a wide range of tasks remains a challenge. This survey systematically categorizes, compares, and summarizes existing DRL frameworks for bipedal locomotion, organizing ... Keywords: artificial intelligence, reinforcement learning, framework
Distilling Privileged Information for Dubins Traveling Salesman Problems with Neighborhoods arxiv.org \| Yesterday	Summary: This paper presents a novel learning approach for Dubins Traveling Salesman Problems(DTSP) with Neighborhood (DTSPN) to quickly produce a tour of a non-holonomic vehicle passing through neighborhoods of given task points. The method involves two learning phases: initially, a model-free reinforcement learning approach leverages privileged information to distill knowledge from expert trajectories generated by the LinKernighan heuristic (LKH) algorithm. Subsequently, a supervised learning phase tra... Keywords: supervised learning, reinforcement learning
SOTOPIA-$$: Interactive Learning of Socially Intelligent Language Agents arxiv.org \| Yesterday	Summary: Humans learn social skills through both imitation and social interaction. This social learning process is largely understudied by existing research on building language agents. Motivated by this gap, we propose an interactive learning method, SOTOPIA-$\pi$, improving the social intelligence of language agents. This method leverages behavior cloning and self-reinforcement training on filtered social interaction data according to large language model (LLM) ratings. We show that our training method... Keywords: ...
Evaluating Collaborative Autonomy in Opposed Environments using Maritime Capture-the-Flag Competitions arxiv.org \| Yesterday	Summary: The objective of this work is to evaluate multi-agent artificial intelligence methods when deployed on teams of unmanned surface vehicles (USV) in an adversarial environment. Autonomous agents were evaluated in real-world scenarios using the Aquaticus test-bed, which is a Capture-the-Flag (CTF) style competition involving teams of USV systems. Cooperative teaming algorithms of various foundations in behavior-based optimization and deep reinforcement learning (RL) were deployed on these USV syste... Keywords: test, artificial intelligence, rl ,
TD3-BST: A Machine Learning Algorithm to Adjust the Strength of Regularization Dynamically Using Uncertai... www.marktechpost.com \| Yesterday	Summary: img width 696 height 711 src class attachment large size large wp post image alt style float left margin 0 15px 15px 0 decoding async loading lazy srcset 1003w, 294w, 768w, 411w, 150w, 300w, 696w, 356w, 24w, 48w, 1054w sizes m... Keywords: machine learning, hyperparameter, reinforcement learning
Large Language Models are Learnable Planners for Long-Term Recommendation dx.doi.org \| Yesterday	Summary: Planning for both immediate and long-term benefits becomes increasingly important in recommendation. Existing methods apply Reinforcement Learning (RL) to learn planning capacity by maximizing cumulative reward for long-term recommendation. However, the scarcity of recommendation data presents challenges such as instability and susceptibility to overfitting when training RL models from scratch, resulting in sub-optimal performance. In this light, we propose to leverage the remarkable planning ca... Keywords: reinforcement learning, rl
Top Artificial Intelligence AI Courses for Beginners in 2024 www.marktechpost.com \| Yesterday	Summary: The popularity of AI has skyrocketed in the past few years, with new avenues being opened up with the rise in the use of large language models LLMs . Having knowledge of AI has now become quite essential as recruiters are actively looking for candid... Keywords: neural network, design, algorithms, supervised
What AIs are not Learning (and Why): Bio-Inspired Foundation Models for Robots arxiv.org \| Yesterday	Summary: It is hard to build robots (including telerobots) that are useful, and harder to build autonomous robots that are robust and general. Current robots are built using manual programming, mathematical models, planning frameworks, and reinforcement learning. These methods do not lead to the leaps in performance and generality seen with deep learning, generative AI, and foundation models (FMs). Today's robots do not learn to provide home care, to be nursing assistants, or to do household chores and o... Keywords: framework, programming, deep learning, ai
CMSs are everywhere and they go beyond publishing dev.to \| Yesterday	Summary: Content Management Systems CMSs are the most important and ubiquitous technologies on the internet. They are essential for todays digital environment. Where there are people, there is content. And where there is content, there is CMS. If you need t... Keywords: course, optimization, hive, ai
PST: Improving Quantitative Trading via Program Sketch-based Tuning arxiv.org \| Yesterday	Summary: Deep reinforcement learning (DRL) has revolutionized quantitative finance by achieving decent performance without significant human expert knowledge. Despite its achievements, we observe that the current state-of-the-art DRL models are still ineffective in identifying the market trend, causing them to miss good trading opportunities or suffer from large drawdowns when encountering market crashes. To tackle this limitation, a natural idea is to embed human expert knowledge regarding the market tr... Keywords: quant, reinforcement learning
Quantum Multi-Agent Reinforcement Learning for Aerial Ad-hoc Networks arxiv.org \| Yesterday	Summary: Quantum machine learning (QML) as combination of quantum computing with machine learning (ML) is a promising direction to explore, in particular due to the advances in realizing quantum computers and the hoped-for quantum advantage. A field within QML that is only little approached is quantum multi-agent reinforcement learning (QMARL), despite having shown to be potentially attractive for addressing industrial applications such as factory management, cellular access and mobility cooperation. Thi... Keywords: machine learning, quantum comp, quant,
Large Language Model as a Policy Teacher for Training Reinforcement Learning Agents arxiv.org \| Yesterday	Summary: Recent studies have uncovered the potential of Large Language Models (LLMs) in addressing complex sequential decision-making tasks through the provision of high-level instructions. However, LLM-based agents lack specialization in tackling specific target problems, particularly in real-time dynamic environments. Additionally, deploying an LLM-based agent in practical scenarios can be both costly and time-consuming. On the other hand, reinforcement learning (RL) approaches train agents that specia... Keywords: reinforcement learning, ios, rl

Please log in to see more search results.

reinforcement - 20 / 53