ML + Vision Top-6 Agent Survey - ICML 2023 - Page 2 of 2

  • Venue: International Conference on Machine Learning
  • Year: 2023
  • Page: 2 / 2
  • Papers: 31-35 / 35
diff History for Neural Language Agents Paper
  • Authors: Ulyana Piterbarg, Lerrel Pinto, Rob Fergus
  • Year: 2023
  • Venue: International Conference on Machine Learning
  • DOI: Not stated.
  • Citations: 4
  • Relevance: 3 / 5
  • Why selected: Heuristic keyword/alias matches: LLM agents (matched: lm agents, language agents).
  • Code: Not found.
  • Extraction: method/data pending

Abstract

Neural Language Models (LMs) offer an exciting solution for general-purpose embodied control. However, a key technical issue arises when using an LM-based controller: environment observations must be converted to text, which coupled with history, results in long and verbose textual prompts. As a result, prior work in LM agents is limited to restricted domains with small observation size as well as minimal needs for interaction history or instruction tuning. In this paper, we introduce diff history, a simple and highly effective solution to these issues. By applying the Unix diff command on consecutive text observations in the interaction histories used to prompt LM policies, we can both abstract away redundant information and focus the content of textual inputs on the salient changes in the environment. On NetHack, an unsolved video game that requires long-horizon reasoning for decision-making, LMs tuned with diff history match state-of-the-art performance for neural agents while needing 1800x fewer training examples compared to prior work. Even on the simpler BabyAI-Text environment with concise text observations, we find that although diff history increases the length of prompts, the representation it provides offers a 25% improvement in the efficiency of low-sample instruction tuning. Further, we show that diff history scales favorably across different tuning dataset sizes. We open-source our code and data to https://diffhistory.github.io.

Claim

Neural Language Models (LMs) offer an exciting solution for general-purpose embodied control.

Divide and Conquer Dynamic Programming: An Almost Linear Time Change Point Detection Methodology in High Dimensions Paper
  • Authors: Wanshan Li, Daren Wang, A. Rinaldo
  • Year: 2023
  • Venue: International Conference on Machine Learning
  • DOI: Not stated.
  • Citations: 3
  • Relevance: 3 / 5
  • Why selected: Heuristic keyword/alias matches: program synthesis (matched: programming).
  • Code: Not found.
  • Extraction: method/data pending

Abstract

We develop a novel, general and computationally efficient framework, called Divide and Conquer Dynamic Programming (DCDP), for localizing change points in time series data with high-dimensional features. DCDP deploys a class of greedy algorithms that are applicable to a broad variety of high-dimensional statistical models and can enjoy almost linear computational complexity. We investigate the performance of DCDP in three commonly studied change point settings in high dimensions: the mean model, the Gaussian graphical model, and the linear regression model. In all three cases, we derive non-asymptotic bounds for the accuracy of the DCDP change point estimators. We demonstrate that the DCDP procedures consistently estimate the change points with sharp, and in some cases, optimal rates while incurring significantly smaller computational costs than the best available algorithms. Our findings are supported by extensive numerical experiments on both synthetic and real data.

Claim

We develop a novel, general and computationally efficient framework, called Divide and Conquer Dynamic Programming (DCDP), for localizing change points in time series data with high-dimensional features.

Trading-Off Payments and Accuracy in Online Classification with Paid Stochastic Experts Paper
  • Authors: Dirk van der Hoeven, Ciara Pike-Burke, Haotian Qiu, Nicolò Cesa-Bianchi
  • Year: 2023
  • Venue: International Conference on Machine Learning
  • DOI: 10.48550/arXiv.2307.00836
  • Citations: 2
  • Relevance: 3 / 5
  • Why selected: Heuristic keyword/alias matches: alpha factor search (matched: trading).
  • Code: Not found.
  • Extraction: method/data pending

Abstract

We investigate online classification with paid stochastic experts. Here, before making their prediction, each expert must be paid. The amount that we pay each expert directly influences the accuracy of their prediction through some unknown Lipschitz"productivity"function. In each round, the learner must decide how much to pay each expert and then make a prediction. They incur a cost equal to a weighted sum of the prediction error and upfront payments for all experts. We introduce an online learning algorithm whose total cost after \(T\) rounds exceeds that of a predictor which knows the productivity of all experts in advance by at most \(\mathcal{O}(K^2(\log T)\sqrt{T})\) where \(K\) is the number of experts. In order to achieve this result, we combine Lipschitz bandits and online classification with surrogate losses. These tools allow us to improve upon the bound of order \(T^{2/3}\) one would obtain in the standard Lipschitz bandit setting. Our algorithm is empirically evaluated on synthetic data

Claim

We investigate online classification with paid stochastic experts.

Efficient Latency-Aware CNN Depth Compression via Two-Stage Dynamic Programming Paper
  • Authors: Jinuk Kim, Yeonwoo Jeong, Deokjae Lee, Hyun Oh Song
  • Year: 2023
  • Venue: International Conference on Machine Learning
  • DOI: 10.48550/arXiv.2301.12187
  • Citations: 1
  • Relevance: 3 / 5
  • Why selected: Heuristic keyword/alias matches: program synthesis (matched: programming).
  • Code: Not found.
  • Extraction: method/data pending

Abstract

Recent works on neural network pruning advocate that reducing the depth of the network is more effective in reducing run-time memory usage and accelerating inference latency than reducing the width of the network through channel pruning. In this regard, some recent works propose depth compression algorithms that merge convolution layers. However, the existing algorithms have a constricted search space and rely on human-engineered heuristics. In this paper, we propose a novel depth compression algorithm which targets general convolution operations. We propose a subset selection problem that replaces inefficient activation layers with identity functions and optimally merges consecutive convolution operations into shallow equivalent convolution operations for efficient end-to-end inference latency. Since the proposed subset selection problem is NP-hard, we formulate a surrogate optimization problem that can be solved exactly via two-stage dynamic programming within a few seconds. We evaluate our methods and baselines by TensorRT for a fair inference latency comparison. Our method outperforms the baseline method with higher accuracy and faster inference speed in MobileNetV2 on the ImageNet dataset. Specifically, we achieve \(1.41\times\) speed-up with \(0.11\)%p accuracy gain in MobileNetV2-1.0 on the ImageNet.

Claim

Recent works on neural network pruning advocate that reducing the depth of the network is more effective in reducing run-time memory usage and accelerating inference latency than reducing the width of the network through channel pruning.

Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic Programming Paper
  • Authors: Xinlei Niu, Christian J. Walder, J. Zhang, Charles Patrick Martin
  • Year: 2023
  • Venue: International Conference on Machine Learning
  • DOI: 10.48550/arXiv.2306.02568
  • Citations: 0
  • Relevance: 3 / 5
  • Why selected: Heuristic keyword/alias matches: program synthesis (matched: programming).
  • Code: Not found.
  • Extraction: method/data pending

Abstract

We propose the stochastic optimal path which solves the classical optimal path problem by a probability-softening solution. This unified approach transforms a wide range of DP problems into directed acyclic graphs in which all paths follow a Gibbs distribution. We show the equivalence of the Gibbs distribution to a message-passing algorithm by the properties of the Gumbel distribution and give all the ingredients required for variational Bayesian inference of a latent path, namely Bayesian dynamic programming (BDP). We demonstrate the usage of BDP in the latent space of variational autoencoders (VAEs) and propose the BDP-VAE which captures structured sparse optimal paths as latent variables. This enables end-to-end training for generative tasks in which models rely on unobserved structural information. At last, we validate the behavior of our approach and showcase its applicability in two real-world applications: text-to-speech and singing voice synthesis. Our implementation code is available at \url{https://github.com/XinleiNIU/LatentOptimalPathsBayesianDP}.

Claim

We propose the stochastic optimal path which solves the classical optimal path problem by a probability-softening solution.