Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Page Not Found

Page not found. Your pixels are in another canvas. Read more

About me Read more

Page not in menu

This is a page not in th emain menu Read more

Jupyter notebook markdown generator

Posts

Deep Reinforcement Learning: Model Based Reinforcement Learning

less than 1 minute read

Published: July 18, 2020

Read more

Deep Reinforcement Learning: Policy Gradient and Actor-Critic

11 minute read

Published: June 16, 2020

In this post, we review the basic policy gradient algorithm for deep reinforcement learning and the actor-critic algorithm. Most of the contents are derived from CS 285 at UC Berkeley. Read more

Theory of Optimization: More on Mirror Descent

2 minute read

Published: February 15, 2019

In this post, we will continue on our discuss of mirror descent. We will present a variant of mirror descent: the lazy mirror descent, also known as Nesterov’s dual averaging. Read more

Theory of Optimization: Frank-Wolfe Algorithm

2 minute read

Published: February 13, 2019

In this post, we describe a new geometry dependent algorithm that relies on different set of assumptions. The algorithm is called conditional gradient descent, aka Frank-Wolfe. Read more

Theory of Optimization: Mirror Descent

7 minute read

Published: February 06, 2019

In this post, we will introduce the Mirror Descent algorithm that solves the convex optimization algorithm. Read more

Theory of Optimization: Projected (Sub)Gradient Descent

6 minute read

Published: February 04, 2019

In this post, we will continue our analysis for gradient descent. Different from the previous post, we will not assume that the function is smooth. We will only assume that the function is convex and has some Lipschitz constant. Read more

Theory of Optimization: Gradient Descent

6 minute read

Published: February 03, 2019

In this post, we will review the most basic and the most intuitive optimization method – the gradient decent method – in optimization. Read more

Theory of Optimization: Preliminaries and Basic Properties

7 minute read

Published: January 31, 2019

Recently, I find an interesting course taught by Prof. Yin Tat Lee at UW. The course is called `Theory of Optimization and Continuous Algorithms’, and the lecture notes are available under the homepage of this courseuw-cse535-winter19. As a great fan of optimization theory and algorithm design, I think I will follow this course and write a bunch of blogs to record my study of this course. Most of the materials in this series of blogs will follow the lecture notes of the course, and and interesting optimization book Convex Optimization: Algorithms and Complexity by Sebastien Bubeck. Since this is the first blog about this course, I will present the preliminaries of the optimization theory, and some basic knowledge about convex optimization, including some basic properties of convex functions. Read more

portfolio

Portfolio item number 1

Short description of portfolio item number 1
Read more

Portfolio item number 2

Short description of portfolio item number 2
Read more

publications

An FPTAS for Stochastic Unbounded Min-Knapsack Problem

Published in International Frontiers of Algorithmics Workshop, 2019

Read more

Download here

Stochastic One-Sided Full-Information Bandit

Published in The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2019

Read more

Download here

Gradient Method for Continuous Influence Maximization with Budget-Saving Considerations

Published in AAAI Conference on Artificial Intelligence, 2019

Read more

Download here

Online Second Price Auction with Semi-bandit Feedback Under the Non-Stationary Setting

Published in AAAI Conference on Artificial Intelligence, 2019

Read more

Download here

Mildly Overparametrized Neural Nets can Memorize Training Data Efficiently

Published in , 2019

Read more

Download here

Combinatorial Pure Exploration of Dueling Bandit

Published in International Conference on Machine Learning, 2020

Read more

Download here

Combinatorial Semi-Bandit in the Non-Stationary Environment

Published in The Conference on Uncertainty in Artificial Intelligence, 2021

Read more

Download here

FedPAGE: A Fast Local Stochastic Gradient Method for Communication-Efficient Federated Learning

Published in , 2021

Read more

Download here

BEER: Fast O(1/T) Rate for Decentralized Nonconvex Optimization with Communication Compression

Published in Conference on Neural Information Processing Systems, 2022

Read more

Download here

SoteriaFL: A Unified Framework for Private Federated Learning with Communication Compression

Published in Conference on Neural Information Processing Systems, 2022

Read more

Download here

Coresets for Vertical Federated Learning: Regularized Linear Regression and K-Means Clustering

Published in Conference on Neural Information Processing Systems, 2022

Read more

Download here

Task-Specific Skill Localization in Fine-tuned Language Models

Published in International Conference on Machine Learning, 2023

Read more

Download here

Faster Rates for Compressed Federated Learning with Client-Variance Reduction

Published in SIAM Journal on Mathematics of Data Science, 2023

Read more

Download here

Do Transformers Parse while Predicting the Masked Word?

Published in Conference on Empirical Methods in Natural Language Processing, 2023

Read more

Download here

Adversarial Attacks on Combinatorial Multi-Armed Bandits

Published in International Conference on Machine Learning, 2024

Read more

Download here

Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates

Published in Conference on Neural Information Processing Systems, 2024

Read more

Download here

Can Models Learn Skill Composition from Examples?

Published in Conference on Neural Information Processing Systems, 2024

Read more

Download here

Unrealized Expectations: Comparing AI Methods vs Classical Algorithms for Maximum Independent Set

Published in , 2025

Read more

Download here

Ineq-Comp: Benchmarking Human-Intuitive Compositional Reasoning in Automated Theorem Proving on Inequalities

Published in , 2025

Read more

Download here

talks

Oral presentation at ECML/PKDD 2019

Published: September 19, 2019

In this talk, I presented my work with Prof. Wei Chen @MSRA on our paper Stochastic One-Sided Full-Information Bandit. The paper can be downloaded here. Read more

Oral presentation at AAAI 2020

Published: February 11, 2020

In this talk, I presented my work with Prof. Wei Chen @MSRA on our paper Online Second Price Auction with Semi-bandit Feedback Under the Non-Stationary Setting. Because of the virus in China, I cannot go the the AAAI main conference, and I will give my oral presentation remotely. The paper can be downloaded here. The PPT is available at here. Read more

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Heading 1

Heading 2

Heading 3

Read more

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.

Heading 1

Heading 2

Heading 3

Read more

Haoyu Zhao

Sitemap

Pages

Posts

portfolio

publications

talks

teaching

Heading 1

Heading 2

Heading 3

Heading 1

Heading 2

Heading 3