Artificial Intelligence and Robotics | Open Access Articles

Bertastic At Semeval-2023 Task 3: Fine-Tuning Pretrained Multilingual Transformers – Does Order Matter?, Tarek Mahmoud, Preslav Nakov

Natural Language Processing Faculty Publications

The naïve approach for fine-tuning pretrained deep learning models on downstream tasks involves feeding them mini-batches of randomly sampled data. In this paper, we propose a more elaborate method for fine-tuning Pretrained Multilingual Transformers (PMTs) on multilingual data. Inspired by the success of curriculum learning approaches, we investigate the significance of fine-tuning PMTs on multilingual data in a sequential fashion language by language. Unlike the curriculum learning paradigm where the model is presented with increasingly complex examples, we do not adopt a notion of “easy” and “hard” samples. Instead, our experiments draw insight from psychological findings on how the human …

Full-Text Articles in Artificial Intelligence and Robotics

Bertastic At Semeval-2023 Task 3: Fine-Tuning Pretrained Multilingual Transformers – Does Order Matter?, Tarek Mahmoud, Preslav Nakov

Natural Language Processing Faculty Publications

Safe Mdp Planning By Learning Temporal Patterns Of Undesirable Trajectories And Averting Negative Side Effects, Siow Meng Low, Akshat Kumar, Scott Sanner

Research Collection School Of Computing and Information Systems

Stability-Based Generalization Analysis For Mixtures Of Pointwise And Pairwise Learning, Jiahuan Wang, Jun Chen, Hong Chen, Bin Gu, Weifu Li, Xin Tang

Machine Learning Faculty Publications

Truncated Matrix Power Iteration For Differentiable Dag Learning, Zhen Zhang, Ignavier Ng, Dong Gong, Yuhang Liu, Ehsan M. Abbasnejad, Mingming Gong, Kun Zhang, Javen Qinfeng Shi

Machine Learning Faculty Publications

Action-Sufficient State Representation Learning For Control With Structural Constraints, Biwei Huang, Chaochao Lu, Liu Leqi, Josã© Miguel Hernã¡Ndez-Lobato, Clark Glymour, Bernhard Schã¶Lkopf, Kun Zhang

Machine Learning Faculty Publications

Learning To Generalize Dispatching Rules On The Job Shop Scheduling, Zangir Iklassov, Dmitrii Medvedev, Ruben Solozabal, Martin Takac

Machine Learning Faculty Publications

Flecs: A Federated Learning Second-Order Framework Via Compression And Sketching, Artem Agafonov, Dmitry Kamzolov, Rachael Tappenden, Alexander Gasnikov, Martin Takac

Machine Learning Faculty Publications

Offline Reinforcement Learning With Causal Structured World Models, Zheng-Mao Zhu, Xiong-Hui Chen, Hong-Long Tian, Kun Zhang, Yang Yu

Machine Learning Faculty Publications

Deep Learning For Anomaly Detection, Guansong Pang, Charu Aggarwal, Chunhua Shen, Nicu Sebe

Research Collection School Of Computing and Information Systems

Self-Supervised Video Object Segmentation Via Cutout Prediction And Tagging, Jyoti Kini, Fahad Shahbaz Khan, Salman Khan, Mubarak Shah

Computer Vision Faculty Publications

Hsva: Hierarchical Semantic-Visual Adaptation For Zero-Shot Learning, Shiming Chen, Guo Sen Xie, Yang Liu, Qinmu Peng, Baigui Sun, Hao Li, Xinge You, Ling Shao

Machine Learning Faculty Publications

Learning From Mistakes - A Framework For Neural Architecture Search, Bhanu Garg, Li Zhang, Pradyumna Sridhara, Ramtin Hosseini, Eric P. Xing, Pengtao Xie

Machine Learning Faculty Publications

Orthogonal Inductive Matrix Completion, Antoine Ledent, Rrodrigo Alves, Marius Kloft

Research Collection School Of Computing and Information Systems

Learning To Fuse Asymmetric Feature Maps In Siamese Trackers, Wencheng Han, Xingping Dong, Fahad Shahbaz Khan, Ling Shao, Jianbing Shen

Computer Vision Faculty Publications

Recurrent Neural Networks With Auxiliary Labels For Cross-Domain Opinion Target Extraction, Ying Ding, Jianfei Yu, Jing Jiang

Research Collection School Of Computing and Information Systems

Message Passing For Collective Graphical Models, Tao Sun, Daniel Sheldon, Akshat Kumar

Research Collection School Of Computing and Information Systems

Motivated Learning As An Extension Of Reinforcement Learning, Janusz Starzyk, Pawel Raif, Ah-Hwee Tan

Research Collection School Of Computing and Information Systems