Skip to content

最新论文 - 2025年04月14日 #20

@github-actions

Description

@github-actions

最后更新:2025-04-14 00:02

本次更新执行命令

D:\a\MyAutoPapers\MyAutoPapers\target\release\my_auto_papers.exe --keywords=
             efficient RL,
             partial observable markov decision process/pomdp,sparse reward reinforcement learning,
             casual RL/counterfactual RL/casual reinforcement learning,
             causal inference/causal discovery/counterfactual reasoning,
             video super resolution,
             knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding,
             combinatorial game theory/xiangqi/chinese chess,
             code llm,
             speech recognition,
             zero shot tracking/few shot tracking/pose tracking/pose estimation,
             text to 3d/image to 3d/text to texture,
             automated theorem proving/interactive theorem proving/formal verification
              --exclude-keywords=multi-agent,multiagent --per-keyword-max-result=8

参数详解

  • 关键词:efficient RL, partial observable markov decision process/pomdp, sparse reward reinforcement learning, casual RL/counterfactual RL/casual reinforcement learning, causal inference/causal discovery/counterfactual reasoning, video super resolution, knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding, combinatorial game theory/xiangqi/chinese chess, code llm, speech recognition, zero shot tracking/few shot tracking/pose tracking/pose estimation, text to 3d/image to 3d/text to texture, automated theorem proving/interactive theorem proving/formal verification
  • 排除关键词:multi-agent, multiagent
  • 每关键词最大结果:8
  • 目标领域:cs, stat
  • 每关键词重试次数:3

论文汇总(199篇)

更好的阅读体验请访问 Github页面

1. efficient RL

序号 标题 日期
1 Handling Delay in Real-Time Reinforcement Learning 2025-03-30
2 Provably Efficient RL for Linear MDPs under Instantaneous Safety Constraints in Non-Convex Feature Spaces 2025-02-25
3 Provably Efficient RL under Episode-Wise Safety in Constrained MDPs with Linear Function Approximation 2025-02-14
4 Improving Transformer World Models for Data-Efficient RL 2025-02-03
5 SLIM: Sim-to-Real Legged Instructive Manipulation via Long-Horizon Visuomotor Learning 2025-01-17
6 Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning 2024-10-29
7 Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL 2024-10-22
8 Computationally Efficient RL under Linear Bellman Completeness for Deterministic Dynamics 2024-06-17

2. partial observable markov decision process/pomdp

序号 标题 日期
1 An Efficient Reservation Protocol for Medium Access: When Tree Splitting Meets Reinforcement Learning 2025-04-03
2 Off-Policy Evaluation for Sequential Persuasion Process with Unobserved Confounding 2025-04-01
3 Real-time Tracking System with Partially Coupled Sources 2025-03-27
4 Observation Adaptation via Annealed Importance Resampling for Partially Observable Markov Decision Processes 2025-03-25
5 Online Hybrid-Belief POMDP with Coupled Semantic-Geometric Models and Semantic Safety Awareness 2025-01-20
6 Parameter Adjustments in POMDP-Based Trajectory Planning for Unsignalized Intersections 2024-12-09
7 Induced Model Matching: Restricted Models Help Train Full-Featured Models 2024-02-19
8 Online POMDP Planning with Anytime Deterministic Optimality Guarantees 2023-10-03

3. sparse reward reinforcement learning

序号 标题 日期
1 Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model 2025-03-14
2 Hedging with Sparse Reward Reinforcement Learning 2025-03-06
3 Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations 2024-12-02
4 Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning 2023-09-08
5 Language Reward Modulation for Pretraining Reinforcement Learning 2023-08-23
6 Exploiting Transformer in Sparse Reward Reinforcement Learning for Interpretable Temporal Logic Motion Planning 2022-09-27
7 Abstract Demonstrations and Adaptive Exploration for Efficient and Stable Multi-step Sparse Reward Reinforcement Learning 2022-07-19

4. casual RL/counterfactual RL/casual reinforcement learning

序号 标题 日期
1 Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL 2025-02-18
2 Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation 2020-12-16

5. causal inference/causal discovery/counterfactual reasoning

序号 标题 日期
1 Relaxing the Markov Requirements on Reinforcement Learning Under Weak Partial Ignorability 2025-04-10
2 Better Decisions through the Right Causal World Model 2025-04-09
3 Causal Inference under Interference through Designed Markets 2025-04-09
4 OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning 2025-04-06
5 From Observation to Orientation: an Adaptive Integer Programming Approach to Intervention Design 2025-04-04
6 When Counterfactual Reasoning Fails: Chaos and Real-World Complexity 2025-03-31
7 A Causal Framework to Measure and Mitigate Non-binary Treatment Discrimination 2025-03-28
8 MASCOTS: Model-Agnostic Symbolic COunterfactual explanations for Time Series 2025-03-28
9 Constraint-based causal discovery with tiered background knowledge and latent variables in single or overlapping datasets 2025-03-27
10 A Contextual Approach to Technological Understanding and Its Assessment 2025-03-27
11 What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation Learning 2025-03-27
12 Differentially Private Joint Independence Test 2025-03-24
13 Neuro Symbolic Knowledge Reasoning for Procedural Video Question Answering 2025-03-19
14 Addressing pitfalls in implicit unobserved confounding synthesis using explicit block hierarchical ancestral sampling 2025-03-12
15 Counterfactual Situation Testing: From Single to Multidimensional Discrimination 2025-02-03
16 Stabilized Inverse Probability Weighting via Isotonic Calibration 2024-11-10
17 Causal generalized linear models via Pearson risk invariance 2024-07-23
18 Prompting or Fine-tuning? Exploring Large Language Models for Causal Graph Validation 2024-05-29
19 Demystifying amortized causal discovery with transformers 2024-05-27
20 Separation-based distance measures for causal graphs 2024-02-07
21 Sample, estimate, aggregate: A recipe for causal discovery foundation models 2024-02-02
22 FedECA: A Federated External Control Arm Method for Causal Inference with Time-To-Event Data in Distributed Settings 2023-11-28
23 Beyond Conditional Averages: Estimating The Individual Causal Effect Distribution 2022-10-29

6. video super resolution

序号 标题 日期
1 FedVSR: Towards Model-Agnostic Federated Learning in Video Super-Resolution 2025-03-17
2 Blind Video Super-Resolution based on Implicit Kernels 2025-03-10
3 Implicit Neural Representation for Video and Image Super-Resolution 2025-03-06
4 Video Super-Resolution: All You Need is a Video Diffusion Model 2025-03-05
5 Low-Resource Video Super-Resolution using Memory, Wavelets, and Deformable Convolutions 2025-02-03
6 BF-STVSR: B-Splines and Fourier-Best Friends for High Fidelity Spatial-Temporal Video Super-Resolution 2025-01-19
7 DiffVSR: Revealing an Effective Recipe for Taming Robust Video Super-Resolution Against Complex Degradations 2025-01-17
8 Spatio-Temporal Distortion Aware Omnidirectional Video Super-Resolution 2024-10-15

7. knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding

序号 标题 日期
1 Detect Anything 3D in the Wild 2025-04-10
2 SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement 2025-04-10
3 Siren Federate: Bridging document, relational, and graph models for exploratory graph analysis 2025-04-10
4 Automated Construction of a Knowledge Graph of Nuclear Fusion Energy for Effective Elicitation and Retrieval of Information 2025-04-10
5 Distilling Knowledge from Heterogeneous Architectures for Semantic Segmentation 2025-04-10
6 ConceptFormer: Towards Efficient Use of Knowledge-Graph Embeddings in Large Language Models 2025-04-10
7 CyberAlly: Leveraging LLMs and Knowledge Graphs to Empower Cyber Defenders 2025-04-10
8 LLM-Enabled Data Transmission in End-to-End Semantic Communication 2025-04-10
9 ThermoStereoRT: Thermal Stereo Matching in Real Time via Knowledge Distillation and Attention-based Refinement 2025-04-10
10 WK-Pnet: FM-Based Positioning via Wavelet Packet Decomposition and Knowledge Distillation 2025-04-10
11 Alice: Proactive Learning with Teacher's Demonstrations for Weak-to-Strong Generalization 2025-04-09
12 Teaching pathology foundation models to accurately predict gene expression with parameter efficient knowledge transfer 2025-04-09
13 TabKAN: Advancing Tabular Data Analysis using Kolmograv-Arnold Network 2025-04-09
14 DiffusionCom: Structure-Aware Multimodal Diffusion Model for Multimodal Knowledge Graph Completion 2025-04-09
15 Hyperbolic Category Discovery 2025-04-08
16 Llama-3-Nanda-10B-Chat: An Open Generative Large Language Model for Hindi 2025-04-08
17 A Behavior-Based Knowledge Representation Improves Prediction of Players' Moves in Chess by 25% 2025-04-07
18 Revealing the Intrinsic Ethical Vulnerability of Aligned Large Language Models 2025-04-07
19 Universal Item Tokenization for Transferable Generative Recommendation 2025-04-06
20 Corrected with the Latest Version: Make Robust Asynchronous Federated Learning Possible 2025-04-05
21 Quantifying Personality in Human-Drone Interactions for Building Heat Loss Inspection with Virtual Reality Training 2025-04-03
22 Localized Definitions and Distributed Reasoning: A Proof-of-Concept Mechanistic Interpretability Study via Activation Patching 2025-04-03
23 F-ViTA: Foundation Model Guided Visible to Thermal Translation 2025-04-03
24 Affordable AI Assistants with Knowledge Graph of Thoughts 2025-04-03
25 How Post-Training Reshapes LLMs: A Mechanistic View on Knowledge, Truthfulness, Refusal, and Confidence 2025-04-03
26 JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model 2025-04-03
27 A Diffusion-Based Framework for Occluded Object Movement 2025-04-02
28 AI-Newton: A Concept-Driven Physical Law Discovery System without Prior Physical Knowledge 2025-04-02
29 Generative Retrieval and Alignment Model: A New Paradigm for E-commerce Retrieval 2025-04-02
30 Towards Communication-Efficient Adversarial Federated Learning for Robust Edge Intelligence 2025-01-25
31 MedCT: A Clinical Terminology Graph for Generative AI Applications in Healthcare 2025-01-11
32 Neuro-Symbolic AI in 2024: A Systematic Review 2025-01-09
33 A Graph-Based Synthetic Data Pipeline for Scaling High-Quality Reasoning Instructions 2024-12-12
34 Design2GarmentCode: Turning Design Concepts to Tangible Garments Through Program Synthesis 2024-12-11
35 Meta-RTL: Reinforcement-Based Meta-Transfer Learning for Low-Resource Commonsense Reasoning 2024-09-27
36 A Model-Agnostic Approach for Semantically Driven Disambiguation in Human-Robot Interaction 2024-09-25
37 Large Language Model Enhanced Knowledge Representation Learning: A Survey 2024-07-01
38 Induced Model Matching: Restricted Models Help Train Full-Featured Models 2024-02-19

8. combinatorial game theory/xiangqi/chinese chess

序号 标题 日期
1 Computational and Algebraic Structure of Board Games 2025-02-18
2 RemoteChess: Enhancing Older Adults' Social Connectedness via Designing a Virtual Reality Chinese Chess (Xiangqi) Community 2025-02-17
3 Temperatures of Robin Hood 2025-01-13
4 On Conway's Numbers and Games, the Von Neumann Universe, and Pure Set Theory 2025-01-08
5 Complete Implementation of WXF Chinese Chess Rules 2024-12-23
6 Maker-Breaker on Galton-Watson trees 2024-12-11
7 Relationship between misère NIM and two-player GOISHI HIROI 2024-12-05
8 The Game Value of Sequential Compounds of Integers and Stars 2024-11-13
9 A New 0(klog n) Algorithm for Josephus Problem 2024-11-10
10 Mastering Chinese Chess AI (Xiangqi) Without Search 2024-10-07
11 An Efficient Multi-Robot Arm Coordination Strategy for Pick-and-Place Tasks using Reinforcement Learning 2024-09-20
12 XQSV: A Structurally Variable Network to Imitate Human Play in Xiangqi 2024-07-05
13 Degrees are Useless in SNORT When Measuring Temperature 2024-06-04
14 Shogi and Frieze group 2023-11-15
15 JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games 2023-08-09
16 Niel's Chess -- Rules for Xiangqi 2023-06-27
17 On the complexity of Dark Chinese Chess 2021-12-06
18 A Note on Hardness Frameworks and Computational Complexity of Xiangqi and Janggi 2019-03-30
19 Comparison Training for Computer Chinese Chess 2018-01-23

9. code llm

序号 标题 日期
1 OpenCodeInstruct: A Large-scale Instruction Tuning Dataset for Code LLMs 2025-04-05
2 On Benchmarking Code LLMs for Android Malware Analysis 2025-04-01
3 Enhancing the Robustness of LLM-Generated Code: Empirical Study and Framework 2025-03-26
4 Every Sample Matters: Leveraging Mixture-of-Experts and High-Quality Data for Efficient and Accurate Code LLM 2025-03-22
5 Automated Harmfulness Testing for Code Large Language Models 2025-03-20
6 KnowCoder-X: Boosting Multilingual Information Extraction via Code 2024-11-07
7 SpecEval: Evaluating Code Comprehension in Large Language Models via Program Specifications 2024-09-19
8 CodeUpdateArena: Benchmarking Knowledge Editing on API Updates 2024-07-08

10. speech recognition

序号 标题 日期
1 Visual-Aware Speech Recognition for Noisy Scenarios 2025-04-09
2 RNN-Transducer-based Losses for Speech Recognition on Noisy Targets 2025-04-09
3 DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation 2025-04-07
4 F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization 2025-04-03
5 ValSub: Subsampling Validation Data to Mitigate Forgetting during ASR Personalization 2025-03-12
6 Let SSMs be ConvNets: State-space Modeling with Optimal Tensor Contractions 2025-01-22
7 DGSNA: prompt-based Dynamic Generative Scene-based Noise Addition method 2024-11-19
8 A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms 2023-06-27

11. zero shot tracking/few shot tracking/pose tracking/pose estimation

序号 标题 日期
1 BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation 2025-04-10
2 DLTPose: 6DoF Pose Estimation From Accurate Dense Surface Point Estimates 2025-04-09
3 Two by Two: Learning Multi-Task Pairwise Objects Assembly for Generalizable Robot Manipulation 2025-04-09
4 Embracing Dynamics: Dynamics-aware 4D Gaussian Splatting SLAM 2025-04-07
5 Learning Affine Correspondences by Integrating Geometric Constraints 2025-04-07
6 A Convex and Global Solution for the P$n$P Problem in 2D Forward-Looking Sonar 2025-04-06
7 Improving Indoor Localization Accuracy by Using an Efficient Implicit Neural Map Representation 2025-03-30
8 FRAME: Floor-aligned Representation for Avatar Motion from Egocentric Video 2025-03-29
9 DynOPETs: A Versatile Benchmark for Dynamic Object Pose Estimation and Tracking in Moving Camera Scenarios 2025-03-25
10 A Modular Edge Device Network for Surgery Digitalization 2025-03-18
11 Robust 6DoF Pose Tracking Considering Contour and Interior Correspondence Uncertainty for AR Assembly Guidance 2025-02-17
12 6DOPE-GS: Online 6D Object Pose Estimation using Gaussian Splatting 2024-12-02
13 SuperQ-GRASP: Superquadrics-based Grasp Pose Estimation on Larger Objects for Mobile-Manipulation 2024-11-07
14 Faster Model Predictive Control via Self-Supervised Initialization Learning 2024-08-06
15 Matching Anything by Segmenting Anything 2024-06-06
16 NuRF: Nudging the Particle Filter in Radiance Fields for Robot Visual Localization 2024-06-01
17 Optimal Robot Formations: Balancing Range-Based Observability and User-Defined Configurations 2024-03-01
18 Multi-Scale Memory Comparison for Zero-/Few-Shot Anomaly Detection 2023-08-09
19 Zero-Shot Anomaly Detection with Pre-trained Segmentation Models 2023-06-15
20 APRIL-GAN: A Zero-/Few-Shot Anomaly Classification and Segmentation Method for CVPR 2023 VAND Workshop Challenge Tracks 1&2: 1st Place on Zero-shot AD and 4th Place on Few-shot AD 2023-05-27
21 Next-generation Surgical Navigation: Marker-less Multi-view 6DoF Pose Estimation of Surgical Instruments 2023-05-05
22 Unifying Tracking and Image-Video Object Detection 2022-11-20
23 Exploring the Effectiveness of Self-supervised Learning and Classifier Chains in Emotion Recognition of Nonverbal Vocalizations 2022-06-21
24 The Multi-speaker Multi-style Voice Cloning Challenge 2021 2021-04-05

12. text to 3d/image to 3d/text to texture

序号 标题 日期
1 Objaverse++: Curated 3D Object Dataset with Quality Annotations 2025-04-09
2 CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading 2025-04-09
3 Stochastic Ray Tracing of 3D Transparent Gaussians 2025-04-09
4 Flash Sculptor: Modular 3D Worlds from Objects 2025-04-08
5 An Empirical Study of GPT-4o Image Generation Capabilities 2025-04-08
6 3D Gaussian Particle Approximation of VDB Datasets: A Study for Scientific Visualization 2025-04-07
7 ConsDreamer: Advancing Multi-View Consistency for Zero-Shot Text-to-3D Generation 2025-04-03
8 3DBonsai: Structure-Aware Bonsai Modeling Using Conditioned 3D Gaussian Splatting 2025-04-02
9 3DGen-Bench: Comprehensive Benchmark Suite for 3D Generative Models 2025-03-27
10 AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction 2025-03-17
11 Evolution 6.0: Evolving Robotic Capabilities Through Generative Design 2025-02-24
12 ProcTex: Consistent and Interactive Text-to-texture Synthesis for Procedural Models 2025-01-28
13 MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D 2024-11-04
14 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation 2024-10-24
15 Layout-your-3D: Controllable and Precise 3D Generation with 2D Blueprint 2024-10-20
16 Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control 2024-10-09
17 RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion Models 2024-09-30
18 GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation 2024-09-27
19 TetSphere Splatting: Representing High-Quality Geometry with Lagrangian Volumetric Meshes 2024-05-30
20 6Img-to-3D: Few-Image Large-Scale Outdoor Driving Scene Reconstruction 2024-04-18
21 DreamScape: 3D Scene Creation via Gaussian Splatting joint Correlation Modeling 2024-04-14
22 FlashTex: Fast Relightable Mesh Texturing with LightControlNet 2024-02-20

13. automated theorem proving/interactive theorem proving/formal verification

序号 标题 日期
1 Efficient Formal Verification of Quantum Error Correcting Programs 2025-04-10
2 Cache-a-lot: Pushing the Limits of Unsatisfiable Core Reuse in SMT-Based Program Analysis 2025-04-10
3 Undecidability of the Emptiness Problem for Weak Models of Distributed Computing 2025-04-09
4 On Coalgebraic Product Constructions for Markov Chains and Automata 2025-04-09
5 Canonical for Automated Theorem Proving in Lean 2025-04-08
6 Leanabell-Prover: Posttraining Scaling in Formal Reasoning 2025-04-08
7 BoolE: Exact Symbolic Reasoning via Boolean Equality Saturation 2025-04-08
8 Bottom-Up Generation of Verilog Designs for Testing EDA Tools 2025-04-06
9 Quantifying Robustness: A Benchmarking Framework for Deep Learning Forecasting in Cyber-Physical Systems 2025-04-04
10 A 2-Categorical Bridge Between Henkin Constructions and Lawvere's Fixed-Point Theorem: Unifying Completeness and Compactness 2025-04-04
11 Automated Discovery of Tactic Libraries for Interactive Theorem Proving 2025-03-31
12 A Natural Transformation between the Model Constructions of the Completeness and Compactness Theorems, Enhanced by Rigidity and 2-Categorical Strengthening 2025-03-19
13 Local Look-Ahead Guidance via Verifier-in-the-Loop for Automated Theorem Proving 2025-03-12
14 Faithful Logic Embeddings in HOL -- A recipe to have it all: deep and shallow, automated and interactive, heavy and light, proofs and counterexamples, meta and object level 2025-02-26
15 LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction 2025-02-25
16 A Combinatorial Identities Benchmark for Theorem Proving via Automated Theorem Generation 2025-02-25
17 Proving the Coding Interview: A Benchmark for Formally Verified Code Generation 2025-02-08
18 HunyuanProver: A Scalable Data Synthesis Framework and Guided Tree Search for Automated Theorem Proving 2024-12-30
19 VeCoGen: Automating Generation of Formally Verified C Code with Large Language Models 2024-11-28
20 Learning Rules Explaining Interactive Theorem Proving Tactic Prediction 2024-11-02
21 Tableaux for Automated Reasoning in Dependently-Typed Higher-Order Logic (Extended Version) 2024-10-18
22 BAIT: Benchmarking (Embedding) Architectures for Interactive Theorem-Proving 2024-03-06
23 Trocq: Proof Transfer for Free, With or Without Univalence 2023-10-21
24 Magnushammer: A Transformer-Based Approach to Premise Selection 2023-03-08

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentation

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions