[20230903] Weekly AI ArXiv 만담 시즌2 - 24회차

### News
- HyperCLOVA X 공개 (8.24)
  - 네이버클라우드 소개페이지: https://www.ncloud.com/solution/featured/hyperclovax
  - DAN23 영상 다시보기: https://tv.naver.com/v/39568301
- [ChatGPT-3.5 Tuning and Enterprise](https://openai.com/blog/gpt-3-5-turbo-fine-tuning-and-api-updates)
- [Google Cloud Next 2023](https://cloud.google.com/blog/ko/topics/google-cloud-next/welcome-to-google-cloud-next-23)
  - TPUv5e
  - 듀엣AI, Vertex AI -- LLM은 B2B로
- [메타, 유럽서 페북·인스타 ‘유료버전’ 검토…EU 규제 영향](https://n.news.naver.com/mnews/article/018/0005565885?sid=105)

### ArXiv
- [DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining](https://arxiv.org/abs/2305.10429)
  - 구글에서 나온 Pretraining 시 corpus 도메인 최적화 하는 방법 연구 (평가가 아주 좋음)
  -  Small referece model 로 small proxy model 만들고 domain weight 최적화 해서 pretrainin corpus 구성
  - 주로 280M을 레퍼런스 모델로 해서 8B에 올려봤는데 FT에서 효과가 아주 좋음
  - GLaM, Pile 데이터셋을 통해 성능평가. 레퍼런스 모델크기에 대한 다양한 ablation
  - Pretraining 을 수행하고자 하는 연구그룹에서는 꼭 참조해 보면 좋을 연구
 ![image](https://github.com/jungwoo-ha/WeeklyArxivTalk/assets/11782739/2a930c2e-8c1d-4a8e-af37-661f5b471902)
 ![image](https://github.com/jungwoo-ha/WeeklyArxivTalk/assets/11782739/41727b10-8d17-4731-8e81-76f573d75dd4)
![image](https://github.com/jungwoo-ha/WeeklyArxivTalk/assets/11782739/7f6c07aa-e4cd-4bc0-a886-27c598f97716)
![image](https://github.com/jungwoo-ha/WeeklyArxivTalk/assets/11782739/f307da06-544a-4f75-a73c-54f3e155074b)

- [The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants](https://arxiv.org/abs/2308.16884)
  - Meta에서 만든 122개 언어를 커버하는 Multi-choice MRC 데이터셋
  - 기반은 다국어 번역 벤치마크인 FLORES-200의 passage들을 기반으로 함
  - 이를 Human - AI collaboration 을 통해 MRC 셋으로 만들어 공개
  - 언어종류도 High, mid, low resource 즉 주류 중간 비주류 언어 모두를 커버하도록
  - 평가는 MLM 모델 (InfoXLM, XLM-V, 번역후 학습), LLM (GPT-3.5-Turbo, LLaMA1,2, Falcon-40B, Zero-shot)
  - Low resource 언어는 모델 커져도 별로 재미를 못보는 듯..
![image](https://github.com/jungwoo-ha/WeeklyArxivTalk/assets/11782739/a30a12ed-493a-4890-80b7-b1776194a1b6)
![image](https://github.com/jungwoo-ha/WeeklyArxivTalk/assets/11782739/529feaca-61f8-4cda-9bad-e93082fe7e7b)
![image](https://github.com/jungwoo-ha/WeeklyArxivTalk/assets/11782739/306d02b0-fcda-4d73-8169-d44108514655)
![image](https://github.com/jungwoo-ha/WeeklyArxivTalk/assets/11782739/2aafaca3-4010-4e84-a576-0c0147c9c83c)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[20230903] Weekly AI ArXiv 만담 시즌2 - 24회차 #90

News

ArXiv

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

[20230903] Weekly AI ArXiv 만담 시즌2 - 24회차 #90

Description

News

ArXiv

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions