Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
MULTI-TASK VISION TRANSFORMER DEVICE BASED ON DISTRIBUTED LEARNING USING RANDOM PATCH PERMUTATION, AND TRANSFORMATION METHOD USING SAME
Document Type and Number:
WIPO Patent Application WO/2024/063442
Kind Code:
A1
Abstract:
Disclosed are a multi-task vision transformer device based on distributed learning using random patch permutation, and a transformation method using same. The transformation method using a multi-task vision transformer device based on distributed learning using random patch permutation, according to one embodiment, may comprise the steps of: using a task non-specific patch embedder so as to prepare patch embedding for each client, passing same through a permutation module, and then transmitting same to a server; and storing the patch embedding received at the server, and using same in order to update a body and a tail portion of a vision transformer model.

Inventors:
YE JONGCHUL (KR)
PARK SANGJOON (KR)
Application Number:
PCT/KR2023/013839
Publication Date:
March 28, 2024
Filing Date:
September 14, 2023
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
KOREA ADVANCED INST SCI & TECH (KR)
International Classes:
G06N3/045; G06N3/042; G06N3/08; G06N20/20
Other References:
PARK SANGJOON, YE JONG CHUL: "Multi-Task Distributed Learning Using Vision Transformer With Random Patch Permutation", IEEE TRANSACTIONS ON MEDICAL IMAGING, IEEE, USA, vol. 42, no. 7, 7 April 2022 (2022-04-07), USA, pages 2091 - 2105, XP093149959, ISSN: 0278-0062, DOI: 10.1109/TMI.2022.3218783
KAI HAN; YUNHE WANG; HANTING CHEN; XINGHAO CHEN; JIANYUAN GUO; ZHENHUA LIU; YEHUI TANG; AN XIAO; CHUNJING XU; YIXING XU; ZHAOHUI Y: "A Survey on Vision Transformer", ARXIV.ORG, CORNELL UNIVERSITY LIBRARY, 201 OLIN LIBRARY CORNELL UNIVERSITY ITHACA, NY 14853, 23 February 2022 (2022-02-23), 201 Olin Library Cornell University Ithaca, NY 14853, XP091145840, DOI: 10.1109/TPAMI.2022.3152247
GONG XUAN; SHARMA ABHISHEK; KARANAM SRIKRISHNA; WU ZIYAN; CHEN TERRENCE; DOERMANN DAVID; INNANJE ARUN: "Ensemble Attention Distillation for Privacy-Preserving Federated Learning", 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), IEEE, 10 October 2021 (2021-10-10), pages 15056 - 15066, XP034093042, DOI: 10.1109/ICCV48922.2021.01480
ALEDHARI MOHAMMED; RAZZAK REHMA; PARIZI REZA M.; SAEED FAHAD: "Federated Learning: A Survey on Enabling Technologies, Protocols, and Applications", IEEE ACCESS, IEEE, USA, vol. 8, 30 July 2020 (2020-07-30), USA , pages 140699 - 140725, XP011802183, DOI: 10.1109/ACCESS.2020.3013541
GUO SHANGWEI, ZHANG XU, YANG FEI, ZHANG TIANWEI, GAN YAN, XIANG TAO, LIU YANG: "Robust and Privacy-Preserving Collaborative Learning: A Comprehensive Survey", ARXIV (CORNELL UNIVERSITY), CORNELL UNIVERSITY LIBRARY, ARXIV.ORG, ITHACA, 19 December 2021 (2021-12-19), Ithaca, pages 1 - 19, XP093149961, Retrieved from the Internet [retrieved on 20240410], DOI: 10.48550/arxiv.2112.10183
Attorney, Agent or Firm:
KIM, Jeong Hoon (KR)
Download PDF: