Relevance Newest Most Cited
51 results
arxiv.org πŸ“… 2021 πŸ“° arXiv πŸ“„ PDF
High-Dimensional Stock Portfolio Trading with Deep Reinforcement Learning
πŸ‘€ Uta Pigorsch; Sebastian SchΓ€fer

This paper proposes a Deep Reinforcement Learning algorithm for financial portfolio trading based on Deep Q-learning. The algorithm is capable of trading high-dimensional portfolios from cross-sectional datasets of any size which may include data gaps and non-unique history lengths in the assets. We sequentially set up…

q-fin.PM cs.LG q-fin.CP
arxiv.org πŸ“… 2021 πŸ“° arXiv πŸ“„ PDF
A step toward a reinforcement learning de novo genome assembler
πŸ‘€ Kleber Padovani; Roberto Xavier; Rafael Cabral Borges; Andre Carvalho; Anna Reali; Annie Chateau; Ronnie Alves

De novo genome assembly is a relevant but computationally complex task in genomics. Although de novo assemblers have been used successfully in several genomics projects, there is still no 'best assembler', and the choice and setup of assemblers still rely on bioinformatics experts. Thus, as with other computationally c…

q-bio.GN cs.AI cs.LG
arxiv.org πŸ“… 2023 πŸ“° arXiv πŸ“„ PDF
Quantitative Trading using Deep Q Learning
πŸ‘€ Soumyadip Sarkar

Reinforcement learning (RL) is a subfield of machine learning that has been used in many fields, such as robotics, gaming, and autonomous systems. There has been growing interest in using RL for quantitative trading, where the goal is to make trades that generate profits in financial markets. This paper presents the us…

q-fin.TR cs.LG q-fin.GN
DOI: 10.22214/ijraset.2023.50170
semanticscholar.org πŸ“… 2025 πŸ“° Nature πŸ”– 5,401 citations
DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning
πŸ‘€ DeepSeek-AI; Daya Guo; Dejian Yang; Haowei Zhang; Jun-Mei Song; Ruoyu Zhang; R. Xu; Qihao Zhu; Shirong Ma; Peiyi Wang; Xiaoling Bi; Xiaokang Zhang; Xingkai Yu; Yu Wu; Z. F. Wu; Zhibin Gou; Zhihong Shao; Zhuoshu Li; Ziyi Gao; A. Liu; Bing Xue; Bing-Li Wang; Bochao Wu; B. Feng; Chengda Lu; Chenggang Zhao; C. Deng; Chenyu Zhang; C. Ruan; Damai Dai; Deli Chen; Dong-Li Ji; Erhang Li; Fangyun Lin; Fucong Dai; Fuli Luo; Guangbo Hao; Guanting Chen; Guowei Li; H. Zhang; Han Bao; Hanwei Xu; Haocheng Wang; Honghui Din

General reasoning represents a long-standing and formidable challenge in artificial intelligence (AI). Recent breakthroughs, exemplified by large language models (LLMs)1,2 and chain-of-thought (CoT) prompting3, have achieved considerable success on foundational reasoning tasks. However, this success is heavily continge…

DOI: 10.1038/s41586-025-09422-z
arxiv.org πŸ“… 2023 πŸ“° arXiv πŸ“„ PDF
Advancing Algorithmic Trading: A Multi-Technique Enhancement of Deep Q-Network Models
πŸ‘€ Gang Hu

This study enhances a Deep Q-Network (DQN) trading model by incorporating advanced techniques like Prioritized Experience Replay, Regularized Q-Learning, Noisy Networks, Dueling, and Double DQN. Extensive tests on assets like BTC/USD and AAPL demonstrate superior performance compared to the original model, with marked …

q-fin.CP
arxiv.org πŸ“… 2025 πŸ“° arXiv πŸ“„ PDF
Distributionally Robust Deep Q-Learning
πŸ‘€ Chung I Lu; Julian Sester; Aijia Zhang

We propose a novel distributionally robust $Q$-learning algorithm for the non-tabular case accounting for continuous state spaces where the state transition of the underlying Markov decision process is subject to model uncertainty. The uncertainty is taken into account by considering the worst-case transition from a ba…

cs.LG math.OC q-fin.PM stat.ML
arxiv.org πŸ“… 2023 πŸ“° arXiv πŸ“„ PDF
Enhancing Physics Learning with ChatGPT, Bing Chat, and Bard as Agents-to-Think-With: A Comparative Case Study
πŸ‘€ Renato P. dos Santos

The rise of AI has brought remarkable advancements in education, with AI models demonstrating their ability to analyse and provide instructive solutions to complex problems. This study compared and analysed the responses of four Generative AI-powered chatbots (GenAIbots) - ChatGPT-3.5, ChatGPT-4, Bing Chat, and Bard - …

physics.ed-ph
arxiv.org πŸ“… 2025 πŸ“° arXiv πŸ“„ PDF
Automated Trading System for Straddle-Option Based on Deep Q-Learning
πŸ‘€ Yiran Wan; Xinyu Ying; Shengzhen Xu

Straddle Option is a financial trading tool that explores volatility premiums in high-volatility markets without predicting price direction. Although deep reinforcement learning has emerged as a powerful approach to trading automation in financial markets, existing work mostly focused on predicting price trends and mak…

q-fin.GN econ.GN
arxiv.org πŸ“… 2024 πŸ“° arXiv πŸ“„ PDF
Generalizing Machine Learning Evaluation through the Integration of Shannon Entropy and Rough Set Theory
πŸ‘€ Olga Cherednichenko; Dmytro Chernyshov; Dmytro Sytnikov; Polina Sytnikova

This research paper delves into the innovative integration of Shannon entropy and rough set theory, presenting a novel approach to generalize the evaluation approach in machine learning. The conventional application of entropy, primarily focused on information uncertainty, is extended through its combination with rough…

cs.LG
arxiv.org πŸ“… 2024 πŸ“° arXiv πŸ“„ PDF
AI, Entrepreneurs, and Privacy: Deep Learning Outperforms Humans in Detecting Entrepreneurs from Image Data
πŸ‘€ Martin Obschonka; Christian Fisch; Tharindu Fernando; Clinton Fookes

Occupational outcomes like entrepreneurship are generally considered personal information that individuals should have the autonomy to disclose. With the advancing capability of artificial intelligence (AI) to infer private details from widely available human-centric data (e.g., social media), it is crucial to investig…

cs.CV eess.IV
Also search: arXiv β†’ PubMed β†’ Semantic Scholar β†’ Google Scholar β†’