Assaf Hallak

צוטט על ידי

	הכל	מאז 2019
ציטוטים ביבליוגרפיים	703	512
H-index	9	8
i10-index	8	8

220

110

165

201420152016201720182019202020212022202320243 6 19 25 21 38 67 81 93 203 30

גישה ציבורית

הצג הכל

3 מאמרים

0 מאמרים

זמין

לא זמין

על סמך ייפוי כח מהמממנים

מחברים משותפים

Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ Nvidia Researchכתובת אימייל מאומתת בדומיין technion.ac.il
Gal DalalSr. Research Scientist, Nvidiaכתובת אימייל מאומתת בדומיין nvidia.com
Georgios TheocharousAdobe Researchכתובת אימייל מאומתת בדומיין adobe.com
Gal ChechikBar Ilan University, NVIDIAכתובת אימייל מאומתת בדומיין biu.ac.il
Dotan Di CastroResearch Manager at Bosch-AI, Haifa, Israelכתובת אימייל מאומתת בדומיין il.bosch.com
Aviv TamarTechnionכתובת אימייל מאומתת בדומיין technion.ac.il
Timothy A MannMetaכתובת אימייל מאומתת בדומיין fb.com
François SchnitzlerSenior Scientist, InterDigitalכתובת אימייל מאומתת בדומיין interdigital.com
Rémi MunosDeepMindכתובת אימייל מאומתת בדומיין inria.fr
Elad Yom-TovBar Ilan Universityכתובת אימייל מאומתת בדומיין yom-tov.info
Noam KoenigsteinTel-Aviv Universityכתובת אימייל מאומתת בדומיין tauex.tau.ac.il
Yishay MansourTel Aviv Universityכתובת אימייל מאומתת בדומיין tauex.tau.ac.il

עקוב אחר

Assaf Hallak

NVIDIA Research

כתובת אימייל מאומתת בדומיין nvidia.com

Reinforcement Learning


כותרת מיון לפי ציטוט ביבליוגרפי מיון לפי שנה מיון לפי כותרת	צוטט על ידי צוטט על ידי	שנה
Contextual markov decision processes‏ A Hallak, D Di Castro, S Mannor‏ arXiv preprint arXiv:1502.02259, 2015‏	212	2015
Lifetime value marketing using reinforcement learning‏ G Theocharous, A Hallak‏ RLDM 2013, 19, 2013‏	196	2013
Consistent on-line off-policy evaluation‏ A Hallak, S Mannor‏ International Conference on Machine Learning, 1372-1383, 2017‏	104	2017
Generalized emphatic temporal difference learning: Bias-variance analysis‏ A Hallak, A Tamar, R Munos, S Mannor‏ Proceedings of the AAAI Conference on Artificial Intelligence 30 (1), 2016‏	55	2016
Off-policy model-based learning under unknown factored dynamics‏ A Hallak, F Schnitzler, T Mann, S Mannor‏ International Conference on Machine Learning, 711-719, 2015‏	40	2015
Model selection in markovian processes‏ A Hallak, D Di-Castro, S Mannor‏ Proceedings of the 19th ACM SIGKDD international conference on Knowledge …, 2013‏	28	2013
Cumulative success-based recommendations for repeat users‏ E Yom-Tov, A Hallak, N Koenigstein‏ US Patent App. 15/605,525, 2018‏	17	2018
On covariate shift of latent confounders in imitation and reinforcement learning‏ G Tennenholtz, A Hallak, G Dalal, S Mannor, G Chechik, U Shalit‏ arXiv preprint arXiv:2110.06539, 2021‏	13	2021
System identification framework‏ G Theocharous, AJ Hallak‏ US Patent 10,558,987, 2020‏	9	2020
Improve agents without retraining: Parallel tree search with off-policy correction‏ G Dalal, A Hallak, S Dalton, S Mannor, G Chechik‏ Advances in Neural Information Processing Systems 34, 5518-5530, 2021‏	8	2021
Planning and learning with adaptive lookahead‏ A Rosenberg, A Hallak, S Mannor, G Chechik, G Dalal‏ Proceedings of the AAAI Conference on Artificial Intelligence 37 (8), 9606-9613, 2023‏	5	2023
Emphatic TD Bellman operator is a contraction‏ A Hallak, A Tamar, S Mannor‏ arXiv preprint arXiv:1508.03411, 2015‏	5	2015
Automatic representation for lifetime value recommender systems‏ A Hallak, Y Mansour, E Yom-Tov‏ arXiv preprint arXiv:1702.07125, 2017‏	4	2017
Testing a marketing strategy offline using an approximate simulator‏ A Hallak, G Theocharous‏ US Patent App. 14/080,038, 2015‏	3	2015
Reinforcement learning with a terminator‏ G Tennenholtz, N Merlis, L Shani, S Mannor, U Shalit, G Chechik, ...‏ Advances in Neural Information Processing Systems 35, 35696-35709, 2022‏	2	2022
SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search‏ G Dalal, A Hallak, G Thoppe, S Mannor, G Chechik‏ arXiv preprint arXiv:2301.13236, 2023‏	1	2023
Off-policy evaluation for MDPs with unknown structure‏ A Hallak, F Schnitzler, T Mann, S Mannor‏ arXiv preprint arXiv:1502.03255, 2015‏	1	2015
SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Expansion‏ G Dalal, A Hallak, G Thoppe, S Mannor, G Chechik‏		2023
Adaptive lookahead for planning and learning‏ S Mannor, G Chechik, G Dalal, AJ Hallak, A Rosenberg‏ US Patent App. 18/158,920, 2023‏		2023
On the Products of Stochastic and Diagonal Matrices‏ A Hallak, G Dalal‏ arXiv preprint arXiv:2304.11634, 2023‏		2023

המערכת אינה יכולה לבצע את הפעולה כעת. נסה שוב מאוחר יותר.

מאמרים 1–20

ציטוטים ביבליוגרפיים בשנה

ציטוטים ביביליוגרפיים כפולים

ציטוטים ביביליוגרפיים שמוזגו

הוסף מחברים שותפיםמחברים משותפים

עקוב אחר

צוטט על ידי

מחברים משותפים