Scholar — deep learning

arxiv.org 📅 2021 📰 arXiv 📄 PDF

High-Dimensional Stock Portfolio Trading with Deep Reinforcement Learning

👤 Uta Pigorsch; Sebastian Schäfer

This paper proposes a Deep Reinforcement Learning algorithm for financial portfolio trading based on Deep Q-learning. The algorithm is capable of trading high-dimensional portfolios from cross-sectional datasets of any size which may include data gaps and non-unique history lengths in the assets. We sequentially set up…

q-fin.PM cs.LG q-fin.CP

arxiv.org 📅 2021 📰 arXiv 📄 PDF

A step toward a reinforcement learning de novo genome assembler

👤 Kleber Padovani; Roberto Xavier; Rafael Cabral Borges; Andre Carvalho; Anna Reali; Annie Chateau; Ronnie Alves

De novo genome assembly is a relevant but computationally complex task in genomics. Although de novo assemblers have been used successfully in several genomics projects, there is still no 'best assembler', and the choice and setup of assemblers still rely on bioinformatics experts. Thus, as with other computationally c…

q-bio.GN cs.AI cs.LG

arxiv.org 📅 2023 📰 arXiv 📄 PDF

Quantitative Trading using Deep Q Learning

👤 Soumyadip Sarkar

Reinforcement learning (RL) is a subfield of machine learning that has been used in many fields, such as robotics, gaming, and autonomous systems. There has been growing interest in using RL for quantitative trading, where the goal is to make trades that generate profits in financial markets. This paper presents the us…

q-fin.TR cs.LG q-fin.GN

DOI: 10.22214/ijraset.2023.50170

semanticscholar.org 📅 2025 📰 Nature 🔖 5,401 citations

DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning

👤 DeepSeek-AI; Daya Guo; Dejian Yang; Haowei Zhang; Jun-Mei Song; Ruoyu Zhang; R. Xu; Qihao Zhu; Shirong Ma; Peiyi Wang; Xiaoling Bi; Xiaokang Zhang; Xingkai Yu; Yu Wu; Z. F. Wu; Zhibin Gou; Zhihong Shao; Zhuoshu Li; Ziyi Gao; A. Liu; Bing Xue; Bing-Li Wang; Bochao Wu; B. Feng; Chengda Lu; Chenggang Zhao; C. Deng; Chenyu Zhang; C. Ruan; Damai Dai; Deli Chen; Dong-Li Ji; Erhang Li; Fangyun Lin; Fucong Dai; Fuli Luo; Guangbo Hao; Guanting Chen; Guowei Li; H. Zhang; Han Bao; Hanwei Xu; Haocheng Wang; Honghui Din

General reasoning represents a long-standing and formidable challenge in artificial intelligence (AI). Recent breakthroughs, exemplified by large language models (LLMs)1,2 and chain-of-thought (CoT) prompting3, have achieved considerable success on foundational reasoning tasks. However, this success is heavily continge…

DOI: 10.1038/s41586-025-09422-z

arxiv.org 📅 2023 📰 arXiv 📄 PDF

Advancing Algorithmic Trading: A Multi-Technique Enhancement of Deep Q-Network Models

👤 Gang Hu

This study enhances a Deep Q-Network (DQN) trading model by incorporating advanced techniques like Prioritized Experience Replay, Regularized Q-Learning, Noisy Networks, Dueling, and Double DQN. Extensive tests on assets like BTC/USD and AAPL demonstrate superior performance compared to the original model, with marked …

q-fin.CP

arxiv.org 📅 2025 📰 arXiv 📄 PDF

Distributionally Robust Deep Q-Learning

👤 Chung I Lu; Julian Sester; Aijia Zhang

We propose a novel distributionally robust $Q$-learning algorithm for the non-tabular case accounting for continuous state spaces where the state transition of the underlying Markov decision process is subject to model uncertainty. The uncertainty is taken into account by considering the worst-case transition from a ba…

cs.LG math.OC q-fin.PM stat.ML

arxiv.org 📅 2023 📰 arXiv 📄 PDF

Enhancing Physics Learning with ChatGPT, Bing Chat, and Bard as Agents-to-Think-With: A Comparative Case Study

👤 Renato P. dos Santos

The rise of AI has brought remarkable advancements in education, with AI models demonstrating their ability to analyse and provide instructive solutions to complex problems. This study compared and analysed the responses of four Generative AI-powered chatbots (GenAIbots) - ChatGPT-3.5, ChatGPT-4, Bing Chat, and Bard - …

physics.ed-ph

arxiv.org 📅 2025 📰 arXiv 📄 PDF

Automated Trading System for Straddle-Option Based on Deep Q-Learning

👤 Yiran Wan; Xinyu Ying; Shengzhen Xu

Straddle Option is a financial trading tool that explores volatility premiums in high-volatility markets without predicting price direction. Although deep reinforcement learning has emerged as a powerful approach to trading automation in financial markets, existing work mostly focused on predicting price trends and mak…

q-fin.GN econ.GN

arxiv.org 📅 2024 📰 arXiv 📄 PDF

Generalizing Machine Learning Evaluation through the Integration of Shannon Entropy and Rough Set Theory

👤 Olga Cherednichenko; Dmytro Chernyshov; Dmytro Sytnikov; Polina Sytnikova

This research paper delves into the innovative integration of Shannon entropy and rough set theory, presenting a novel approach to generalize the evaluation approach in machine learning. The conventional application of entropy, primarily focused on information uncertainty, is extended through its combination with rough…

cs.LG

arxiv.org 📅 2024 📰 arXiv 📄 PDF

AI, Entrepreneurs, and Privacy: Deep Learning Outperforms Humans in Detecting Entrepreneurs from Image Data

👤 Martin Obschonka; Christian Fisch; Tharindu Fernando; Clinton Fookes

Occupational outcomes like entrepreneurship are generally considered personal information that individuals should have the autonomy to disclose. With the advancing capability of artificial intelligence (AI) to infer private details from widely available human-centric data (e.g., social media), it is crucial to investig…

cs.CV eess.IV

📚 UNiON Scholar