Login:

Friday, November 15, 2024

Today's top trending papers in Computer Science

681,908 papers ranked by PageRank*. +367 new papers added in the last 11 hours. Read more.

Filters:

Sort:

All categories
  • All categories
  • cs
    Computer Vision and Pattern Recognition
    90
  • cs
    Machine Learning
    58
  • cs
    Computation and Language
    40
  • cs
    Information Theory
    18
  • cs
    Robotics
    32
  • cs
    Cryptography and Security
    13
  • cs
    Artificial Intelligence
    24
  • eess
    Systems and Control
    17
  • math
    Numerical Analysis
    25
  • cs
    Networking and Internet Architecture
    8
  • cs
    Data Structures and Algorithms
    8
  • stat
    Machine Learning
    11
  • cs
    Software Engineering
    15
  • cs
    Distributed, Parallel, and Cluster Computing
    3
  • eess
    Image and Video Processing
    19
  • cs
    Human-Computer Interaction
    6
  • cs
    Computers and Society
    22
  • cs
    Information Retrieval
    12
  • cs
    Social and Information Networks
    5
  • cs
    Logic in Computer Science
    3
  • math
    Optimization and Control
    6
  • cs
    Computer Science and Game Theory
    6
  • cs
    Sound
    6
  • quant-ph
    Quantum Physics
    10
  • cs
    Neural and Evolutionary Computing
    1
  • eess
    Signal Processing
    5
  • cs
    Databases
  • eess
    Audio and Speech Processing
    2
  • math
    Combinatorics
    1
  • cs
    Computational Complexity
    3
  • cs
    Programming Languages
    4
  • cs
    Discrete Mathematics
    5
  • physics
    Physics and Society
  • cs
    Computational Geometry
  • cs
    Computational Engineering, Finance, and Science
    1
  • cs
    Digital Libraries
    1
  • cs
    Hardware Architecture
    5
  • cs
    Formal Languages and Automata Theory
  • cs
    Multiagent Systems
    2
  • cs
    Graphics
  • cs
    Multimedia
    2
  • cs
    Emerging Technologies
  • cs
    Other Computer Science
  • math
    Probability
    1
  • q-bio
    Neurons and Cognition
    1
  • q-bio
    Quantitative Methods
    1
  • math
    Statistics Theory
  • stat
    Methodology
    1
  • cs
    Symbolic Computation
  • cs
    Performance
  • physics
    Computational Physics
    1
  • cs
    Mathematical Software
  • stat
    Applications
  • astro-ph
    Instrumentation and Methods for Astrophysics
  • math
    Logic
  • physics
    Fluid Dynamics
    2
  • math
    Dynamical Systems
    1
  • cond-mat
    Materials Science
    1
  • math
    Analysis of PDEs
    1
  • q-bio
    Biomolecules
    1
  • physics
    Medical Physics
    1
  • math
    Number Theory
  • q-bio
    Populations and Evolution
  • cond-mat
    Statistical Mechanics
  • cs
    Operating Systems
  • stat
    Computation
  • q-bio
    Genomics
    1
  • q-fin
    Statistical Finance
  • cond-mat
    Disordered Systems and Neural Networks
  • math
    Algebraic Geometry
  • physics
    Atmospheric and Oceanic Physics
    1
  • physics
    Chemical Physics
  • physics
    Data Analysis, Statistics and Probability
  • physics
    Optics
    2
  • econ
    General Economics
  • econ
    Theoretical Economics
    1
  • physics
    Geophysics
    1
  • math
    Functional Analysis
  • math
    Group Theory
  • math
    Algebraic Topology
  • math
    Category Theory
    1
  • physics
    Applied Physics
    1
  • q-fin
    Computational Finance
  • math-ph
    Mathematical Physics
    1
  • math
    Metric Geometry
  • q-bio
    Molecular Networks
  • nlin
    Adaptation and Self-Organizing Systems
  • physics
    Instrumentation and Detectors
  • q-fin
    Trading and Market Microstructure
    1
  • astro-ph
    Earth and Planetary Astrophysics
    1
  • hep-ph
    Phenomenology
    1
  • math
    Classical Analysis and ODEs
  • math
    Rings and Algebras
  • econ
    Econometrics
  • q-fin
    General Finance
  • q-fin
    Portfolio Management
  • astro-ph
    Cosmology and Nongalactic Astrophysics
  • astro-ph
    Solar and Stellar Astrophysics
  • math
    Commutative Algebra
  • math
    History and Overview
  • hep-ex
    Experiment
  • cond-mat
    Soft Condensed Matter
  • physics
    Plasma Physics
  • math
    Differential Geometry
  • nlin
    Chaotic Dynamics
  • cond-mat
    Mesoscale and Nanoscale Physics
  • math
    Geometric Topology
  • physics
    Biological Physics
  • q-fin
    Risk Management
  • hep-lat
    Lattice
  • physics
    Physics Education
  • hep-th
    Theory
  • gr-qc
    General Relativity and Quantum Cosmology
  • nlin
    Cellular Automata and Lattice Gases
  • cs
    General Literature
  • astro-ph
    Astrophysics of Galaxies
  • q-bio
    Other Quantitative Biology
  • q-bio
    Tissues and Organs
  • math
    Spectral Theory
  • stat
    Other Statistics
  • physics
    Accelerator Physics
  • nlin
    Pattern Formation and Solitons
  • astro-ph
    High Energy Astrophysical Phenomena
  • cond-mat
    Strongly Correlated Electrons
  • physics
    Space Physics
  • physics
    Classical Physics
  • q-fin
    Mathematical Finance
  • math
    General Topology
  • math
    Representation Theory
  • math
    Operator Algebras
  • q-fin
    Pricing of Securities
  • nlin
    Exactly Solvable and Integrable Systems
  • nucl-th
    Nuclear Theory
  • physics
    History and Philosophy of Physics
  • cond-mat
    Superconductivity
  • math
    Complex Variables
  • physics
    Popular Physics
  • q-bio
    Cell Behavior
  • Astrophysics
  • math
    General Mathematics
  • cond-mat
    Quantum Gases
  • physics
    General Physics
  • cond-mat
    Other Condensed Matter
  • q-bio
    Subcellular Processes
  • math
    Quantum Algebra
  • physics
    Atomic and Molecular Clusters
  • physics
    Atomic Physics
  • math
    Symplectic Geometry
  • math
    K-Theory and Homology
  • nucl-ex
    Nuclear Experiment

Mistral 7B

PageRank: 1,250
Growth: +6,584%
Citations: 3,290

Jiang, Albert Q. | Sablayrolles, Alexandre | Mensch, Arthur | Bamford, Chris | Chaplot, Devendra Singh | Casas, Diego de las | Bressand, Florian | Lengyel, Gianna | Lample, Guillaume | Saulnier, Lucile | Lavaud, Lélio Renard | Lachaux, Marie-Anne | Stock, Pierre | Scao, Teven Le | Lavril, Thibaut | Wang, Thomas | Lacroix, Timothée | Sayed, William El

Oct 10, 2023 – Mistral 7B is a new language model with 7 billion parameters that outperforms other models in various benchmarks, including reasoning, mathematics, and code generation. It utilizes grouped-query attention and sliding window attention for faster inference and reduced cost, and there is also a fine-tuned version called Mistral 7B -- Instruct that performs better than other models in following instructions.

Gemini: A Family of Highly Capable Multimodal Models

PageRank: 2,340
Growth: +3,386%
Citations: 2,074

Gemini Team | Anil, Rohan | Borgeaud, Sebastian | Alayrac, Jean-Baptiste | Yu, Jiahui | Soricut, Radu | Schalkwyk, Johan | Dai, Andrew M. | Hauth, Anja | Millican, Katie | Silver, David | Johnson, Melvin | Antonoglou, Ioannis | Schrittwieser, Julian | Glaese, Amelia | Chen, Jilin | Pitler, Emily | Lillicrap, Timothy | Lazaridou, Angeliki | Firat, Orhan | Molloy, James | Isard, Michael | Barham, Paul R. | Hennigan, Tom | Lee, Benjamin | Viola, Fabio | Reynolds, Malcolm | Xu, Yuanzhong | Doherty, Ryan | Collins, Eli | Meyer, Clemens | Rutherford, Eliza | Moreira, Erica | Ayoub, Kareem | Goel, Megha | Krawczyk, Jack | Du, Cosmo | Chi, Ed | Cheng, Heng-Tze | Ni, Eric | Shah, Purvi | Kane, Patrick | Chan, Betty | Faruqui, Manaal | Severyn, Aliaksei | Lin, Hanzhao | Li, YaGuang | Cheng, Yong | Ittycheriah, Abe | Mahdieh, Mahdis | Chen, Mia | Sun, Pei | Tran, Dustin | Bagri, Sumit | Lakshminarayanan, Balaji | Liu, Jeremiah | Orban, Andras | Güra, Fabian | Zhou, Hao | Song, Xinying | Boffy, Aurelien | Ganapathy, Harish | Zheng, Steven | Choe, HyunJeong | Weisz, Ágoston | Zhu, Tao | Lu, Yifeng | Gopal, Siddharth | Kahn, Jarrod | Kula, Maciej | Pitman, Jeff | Shah, Rushin | Taropa, Emanuel | Merey, Majd Al | Baeuml, Martin | Chen, Zhifeng | Shafey, Laurent El | Zhang, Yujing | Sercinoglu, Olcan | Tucker, George | Piqueras, Enrique | Krikun, Maxim | Barr, Iain | Savinov, Nikolay | Danihelka, Ivo | Roelofs, Becca | White, Anaïs | Andreassen, Anders | von Glehn, Tamara | Yagati, Lakshman | Kazemi, Mehran | Gonzalez, Lucas | Khalman, Misha | Sygnowski, Jakub | Frechette, Alexandre | Smith, Charlotte | Culp, Laura | Proleev, Lev | Luan, Yi | Chen, Xi | Lottes, James | Schucher, Nathan | Lebron, Federico | Rrustemi, Alban | Clay, Natalie | Crone, Phil | Kocisky, Tomas | Zhao, Jeffrey | Perz, Bartek | Yu, Dian | Howard, Heidi | Bloniarz, Adam | Rae, Jack W. | Lu, Han | Sifre, Laurent | Maggioni, Marcello | Alcober, Fred | Garrette, Dan | Barnes, Megan | Thakoor, Shantanu | Austin, Jacob | Barth-Maron, Gabriel | Wong, William | Joshi, Rishabh | Chaabouni, Rahma | Fatiha, Deeni | Ahuja, Arun | Tomar, Gaurav Singh | Senter, Evan | Chadwick, Martin | Kornakov, Ilya | Attaluri, Nithya | Iturrate, Iñaki | Liu, Ruibo | Li, Yunxuan | Cogan, Sarah | Chen, Jeremy | Jia, Chao | Gu, Chenjie | Zhang, Qiao | Grimstad, Jordan | Hartman, Ale Jakse | Garcia, Xavier | Pillai, Thanumalayan Sankaranarayana | Devlin, Jacob | Laskin, Michael | Casas, Diego de Las | Valter, Dasha | Tao, Connie | Blanco, Lorenzo | Badia, Adrià Puigdomènech | Reitter, David | Chen, Mianna | Brennan, Jenny | Rivera, Clara | Brin, Sergey | Iqbal, Shariq | Surita, Gabriela | Labanowski, Jane | Rao, Abhi | Winkler, Stephanie | Parisotto, Emilio | Gu, Yiming | Olszewska, Kate | Addanki, Ravi | Miech, Antoine | Louis, Annie | Teplyashin, Denis | Brown, Geoff | Catt, Elliot | Balaguer, Jan | Xiang, Jackie | Wang, Pidong | Ashwood, Zoe | Briukhov, Anton | Webson, Albert | Ganapathy, Sanjay | Sanghavi, Smit | Kannan, Ajay | Chang, Ming-Wei | Stjerngren, Axel | Djolonga, Josip | Sun, Yuting | Bapna, Ankur | Aitchison, Matthew | Pejman, Pedram | Michalewski, Henryk | Yu, Tianhe | Wang, Cindy | Love, Juliette | Ahn, Junwhan | Bloxwich, Dawn | Han, Kehang | Humphreys, Peter | Sellam, Thibault | Bradbury, James | Godbole, Varun | Samangooei, Sina | Damoc, Bogdan | Kaskasoli, Alex | Arnold, Sébastien M. R. | Vasudevan, Vijay | Agrawal, Shubham | Riesa, Jason | Lepikhin, Dmitry | Tanburn, Richard | Srinivasan, Srivatsan | Lim, Hyeontaek | Hodkinson, Sarah | Shyam, Pranav | Ferret, Johan | Hand, Steven | Garg, Ankush | Paine, Tom Le | Li, Jian | Li, Yujia | Giang, Minh | Neitz, Alexander | Abbas, Zaheer | York, Sarah | Reid, Machel | Cole, Elizabeth | Chowdhery, Aakanksha | Das, Dipanjan | Rogozińska, Dominika | Nikolaev, Vitaliy | Sprechmann, Pablo | Nado, Zachary | Zilka, Lukas | Prost, Flavien | He, Luheng | Monteiro, Marianne | Mishra, Gaurav | Welty, Chris | Newlan, Josh | Jia, Dawei | Allamanis, Miltiadis | Hu, Clara Huiyi | de Liedekerke, Raoul | Gilmer, Justin | Saroufim, Carl | Rijhwani, Shruti | Hou, Shaobo | Shrivastava, Disha | Baddepudi, Anirudh | Goldin, Alex | Ozturel, Adnan | Cassirer, Albin | Xu, Yunhan | Sohn, Daniel | Sachan, Devendra | Amplayo, Reinald Kim | Swanson, Craig | Petrova, Dessie | Narayan, Shashi | Guez, Arthur | Brahma, Siddhartha | Landon, Jessica | Patel, Miteyan | Zhao, Ruizhe | Villela, Kevin | Wang, Luyu | Jia, Wenhao | Rahtz, Matthew | Giménez, Mai | Yeung, Legg | Keeling, James | Georgiev, Petko | Mincu, Diana | Wu, Boxi | Haykal, Salem | Saputro, Rachel | Vodrahalli, Kiran | Qin, James | Cankara, Zeynep | Sharma, Abhanshu | Fernando, Nick | Hawkins, Will | Neyshabur, Behnam | Kim, Solomon | Hutter, Adrian | Agrawal, Priyanka | Castro-Ros, Alex | Driessche, George van den | Wang, Tao | Yang, Fan | Chang, Shuo-yiin | Komarek, Paul | McIlroy, Ross | Lučić, Mario | Zhang, Guodong | Farhan, Wael | Sharman, Michael | Natsev, Paul | Michel, Paul | Bansal, Yamini | Qiao, Siyuan | Cao, Kris | Shakeri, Siamak | Butterfield, Christina | Chung, Justin | Rubenstein, Paul Kishan | Agrawal, Shivani | Mensch, Arthur | Soparkar, Kedar | Lenc, Karel | Chung, Timothy | Pope, Aedan | Maggiore, Loren | Kay, Jackie | Jhakra, Priya | Wang, Shibo | Maynez, Joshua | Phuong, Mary | Tobin, Taylor | Tacchetti, Andrea | Trebacz, Maja | Robinson, Kevin | Katariya, Yash | Riedel, Sebastian | Bailey, Paige | Xiao, Kefan | Ghelani, Nimesh | Aroyo, Lora | Slone, Ambrose | Houlsby, Neil | Xiong, Xuehan | Yang, Zhen | Gribovskaya, Elena | Adler, Jonas | Wirth, Mateo | Lee, Lisa | Li, Music | Kagohara, Thais | Pavagadhi, Jay | Bridgers, Sophie | Bortsova, Anna | Ghemawat, Sanjay | Ahmed, Zafarali | Liu, Tianqi | Powell, Richard | Bolina, Vijay | Iinuma, Mariko | Zablotskaia, Polina | Besley, James | Chung, Da-Woon | Dozat, Timothy | Comanescu, Ramona | Si, Xiance | Greer, Jeremy | Su, Guolong | Polacek, Martin | Kaufman, Raphaël Lopez | Tokumine, Simon | Hu, Hexiang | Buchatskaya, Elena | Miao, Yingjie | Elhawaty, Mohamed | Siddhant, Aditya | Tomasev, Nenad | Xing, Jinwei | Greer, Christina | Miller, Helen | Ashraf, Shereen | Roy, Aurko | Zhang, Zizhao | Ma, Ada | Filos, Angelos | Besta, Milos | Blevins, Rory | Klimenko, Ted | Yeh, Chih-Kuan | Changpinyo, Soravit | Mu, Jiaqi | Chang, Oscar | Pajarskas, Mantas | Muir, Carrie | Cohen, Vered | Lan, Charline Le | Haridasan, Krishna | Marathe, Amit | Hansen, Steven | Douglas, Sholto | Samuel, Rajkumar | Wang, Mingqiu | Austin, Sophia | Lan, Chang | Jiang, Jiepu | Chiu, Justin | Lorenzo, Jaime Alonso | Sjösund, Lars Lowe | Cevey, Sébastien | Gleicher, Zach | Avrahami, Thi | Boral, Anudhyan | Srinivasan, Hansa | Selo, Vittorio | May, Rhys | Aisopos, Konstantinos | Hussenot, Léonard | Soares, Livio Baldini | Baumli, Kate | Chang, Michael B. | Recasens, Adrià | Caine, Ben | Pritzel, Alexander | Pavetic, Filip | Pardo, Fabio | Gergely, Anita | Frye, Justin | Ramasesh, Vinay | Horgan, Dan | Badola, Kartikeya | Kassner, Nora | Roy, Subhrajit | Dyer, Ethan | Campos, Víctor Campos | Tomala, Alex | Tang, Yunhao | Badawy, Dalia El | White, Elspeth | Mustafa, Basil | Lang, Oran | Jindal, Abhishek | Vikram, Sharad | Gong, Zhitao | Caelles, Sergi | Hemsley, Ross | Thornton, Gregory | Feng, Fangxiaoyu | Stokowiec, Wojciech | Zheng, Ce | Thacker, Phoebe | Ünlü, Çağlar | Zhang, Zhishuai | Saleh, Mohammad | Svensson, James | Bileschi, Max | Patil, Piyush | Anand, Ankesh | Ring, Roman | Tsihlas, Katerina | Vezer, Arpi | Selvi, Marco | Shevlane, Toby | Rodriguez, Mikel | Kwiatkowski, Tom | Daruki, Samira | Rong, Keran | Dafoe, Allan | FitzGerald, Nicholas | Gu-Lemberg, Keren | Khan, Mina | Hendricks, Lisa Anne | Pellat, Marie | Feinberg, Vladimir | Cobon-Kerr, James | Sainath, Tara | Rauh, Maribeth | Hashemi, Sayed Hadi | Ives, Richard | Hasson, Yana | Noland, Eric | Cao, Yuan | Byrd, Nathan | Hou, Le | Wang, Qingze | Sottiaux, Thibault | Paganini, Michela | Lespiau, Jean-Baptiste | Moufarek, Alexandre | Hassan, Samer | Shivakumar, Kaushik | van Amersfoort, Joost | Mandhane, Amol | Joshi, Pratik | Goyal, Anirudh | Tung, Matthew | Brock, Andrew | Sheahan, Hannah | Misra, Vedant | Li, Cheng | Rakićević, Nemanja | Dehghani, Mostafa | Liu, Fangyu | Mittal, Sid | Oh, Junhyuk | Noury, Seb | Sezener, Eren | Huot, Fantine | Lamm, Matthew | De Cao, Nicola | Chen, Charlie | Mudgal, Sidharth | Stella, Romina | Brooks, Kevin | Vasudevan, Gautam | Liu, Chenxi | Chain, Mainak | Melinkeri, Nivedita | Cohen, Aaron | Wang, Venus | Seymore, Kristie | Zubkov, Sergey | Goel, Rahul | Yue, Summer | Krishnakumaran, Sai | Albert, Brian | Hurley, Nate | Sano, Motoki | Mohananey, Anhad | Joughin, Jonah | Filonov, Egor | Kępa, Tomasz | Eldawy, Yomna | Lim, Jiawern | Rishi, Rahul | Badiezadegan, Shirin | Bos, Taylor | Chang, Jerry | Jain, Sanil | Padmanabhan, Sri Gayatri Sundara | Puttagunta, Subha | Krishna, Kalpesh | Baker, Leslie | Kalb, Norbert | Bedapudi, Vamsi | Kurzrok, Adam | Lei, Shuntong | Yu, Anthony | Litvin, Oren | Zhou, Xiang | Wu, Zhichun | Sobell, Sam | Siciliano, Andrea | Papir, Alan | Neale, Robby | Bragagnolo, Jonas | Toor, Tej | Chen, Tina | Anklin, Valentin | Wang, Feiran | Feng, Richie | Gholami, Milad | Ling, Kevin | Liu, Lijuan | Walter, Jules | Moghaddam, Hamid | Kishore, Arun | Adamek, Jakub | Mercado, Tyler | Mallinson, Jonathan | Wandekar, Siddhinita | Cagle, Stephen | Ofek, Eran | Garrido, Guillermo | Lombriser, Clemens | Mukha, Maksim | Sun, Botu | Mohammad, Hafeezul Rahman | Matak, Josip | Qian, Yadi | Peswani, Vikas | Janus, Pawel | Yuan, Quan | Schelin, Leif | David, Oana | Garg, Ankur | He, Yifan | Duzhyi, Oleksii | Älgmyr, Anton | Lottaz, Timothée | Li, Qi | Yadav, Vikas | Xu, Luyao | Chinien, Alex | Shivanna, Rakesh | Chuklin, Aleksandr | Li, Josie | Spadine, Carrie | Wolfe, Travis | Mohamed, Kareem | Das, Subhabrata | Dai, Zihang | He, Kyle | von Dincklage, Daniel | Upadhyay, Shyam | Maurya, Akanksha | Chi, Luyan | Krause, Sebastian | Salama, Khalid | Rabinovitch, Pam G | M, Pavan Kumar Reddy | Selvan, Aarush | Dektiarev, Mikhail | Ghiasi, Golnaz | Guven, Erdem | Gupta, Himanshu | Liu, Boyi | Sharma, Deepak | Shtacher, Idan Heimlich | Paul, Shachi | Akerlund, Oscar | Aubet, François-Xavier | Huang, Terry | Zhu, Chen | Zhu, Eric | Teixeira, Elico | Fritze, Matthew | Bertolini, Francesco | Marinescu, Liana-Eleonora | Bölle, Martin | Paulus, Dominik | Gupta, Khyatti | Latkar, Tejasi | Chang, Max | Sanders, Jason | Wilson, Roopa | Wu, Xuewei | Tan, Yi-Xuan | Thiet, Lam Nguyen | Doshi, Tulsee | Lall, Sid | Mishra, Swaroop | Chen, Wanming | Luong, Thang | Benjamin, Seth | Lee, Jasmine | Andrejczuk, Ewa | Rabiej, Dominik | Ranjan, Vipul | Styrc, Krzysztof | Yin, Pengcheng | Simon, Jon | Harriott, Malcolm Rose | Bansal, Mudit | Robsky, Alexei | Bacon, Geoff | Greene, David | Mirylenka, Daniil | Zhou, Chen | Sarvana, Obaid | Goyal, Abhimanyu | Andermatt, Samuel | Siegler, Patrick | Horn, Ben | Israel, Assaf | Pongetti, Francesco | Chen, Chih-Wei "Louis" | Selvatici, Marco | Silva, Pedro | Wang, Kathie | Tolins, Jackson | Guu, Kelvin | Yogev, Roey | Cai, Xiaochen | Agostini, Alessandro | Shah, Maulik | Nguyen, Hung | Donnaile, Noah Ó | Pereira, Sébastien | Friso, Linda | Stambler, Adam | Kuang, Chenkai | Romanikhin, Yan | Geller, Mark | Yan, ZJ | Jang, Kane | Lee, Cheng-Chun | Fica, Wojciech | Malmi, Eric | Tan, Qijun | Banica, Dan | Balle, Daniel | Pham, Ryan | Huang, Yanping | Avram, Diana | Shi, Hongzhi | Singh, Jasjot | Hidey, Chris | Ahuja, Niharika | Saxena, Pranab | Dooley, Dan | Potharaju, Srividya Pranavi | O'Neill, Eileen | Gokulchandran, Anand | Foley, Ryan | Zhao, Kai | Dusenberry, Mike | Liu, Yuan | Mehta, Pulkit | Kotikalapudi, Ragha | Safranek-Shrader, Chalence | Goodman, Andrew | Kessinger, Joshua | Globen, Eran | Kolhar, Prateek | Gorgolewski, Chris | Ibrahim, Ali | Song, Yang | Eichenbaum, Ali | Brovelli, Thomas | Potluri, Sahitya | Lahoti, Preethi | Baetu, Cip | Ghorbani, Ali | Chen, Charles | Crawford, Andy | Pal, Shalini | Sridhar, Mukund | Gurita, Petru | Mujika, Asier | Petrovski, Igor | Cedoz, Pierre-Louis | Li, Chenmei | Chen, Shiyuan | Santo, Niccolò Dal | Goyal, Siddharth | Punjabi, Jitesh | Kappaganthu, Karthik | Kwak, Chester | LV, Pallavi | Velury, Sarmishta | Choudhury, Himadri | Hall, Jamie | Shah, Premal | Figueira, Ricardo | Thomas, Matt | Lu, Minjie | Zhou, Ting | Kumar, Chintu | Jurdi, Thomas | Chikkerur, Sharat | Ma, Yenai | Yu, Adams | Kwak, Soo | Ähdel, Victor | Rajayogam, Sujeevan | Choma, Travis | Liu, Fei | Barua, Aditya | Ji, Colin | Park, Ji Ho | Hellendoorn, Vincent | Bailey, Alex | Bilal, Taylan | Zhou, Huanjie | Khatir, Mehrdad | Sutton, Charles | Rzadkowski, Wojciech | Macintosh, Fiona | Shagin, Konstantin | Medina, Paul | Liang, Chen | Zhou, Jinjing | Shah, Pararth | Bi, Yingying | Dankovics, Attila | Banga, Shipra | Lehmann, Sabine | Bredesen, Marissa | Lin, Zifan | Hoffmann, John Eric | Lai, Jonathan | Chung, Raynald | Yang, Kai | Balani, Nihal | Bražinskas, Arthur | Sozanschi, Andrei | Hayes, Matthew | Alcalde, Héctor Fernández | Makarov, Peter | Chen, Will | Stella, Antonio | Snijders, Liselotte | Mandl, Michael | Kärrman, Ante | Nowak, Paweł | Wu, Xinyi | Dyck, Alex | Vaidyanathan, Krishnan | R, Raghavender | Mallet, Jessica | Rudominer, Mitch | Johnston, Eric | Mittal, Sushil | Udathu, Akhil | Christensen, Janara | Verma, Vishal | Irving, Zach | Santucci, Andreas | Elsayed, Gamaleldin | Davoodi, Elnaz | Georgiev, Marin | Tenney, Ian | Hua, Nan | Cideron, Geoffrey | Leurent, Edouard | Alnahlawi, Mahmoud | Georgescu, Ionut | Wei, Nan | Zheng, Ivy | Scandinaro, Dylan | Jiang, Heinrich | Snoek, Jasper | Sundararajan, Mukund | Wang, Xuezhi | Ontiveros, Zack | Karo, Itay | Cole, Jeremy | Rajashekhar, Vinu | Tumeh, Lara | Ben-David, Eyal | Jain, Rishub | Uesato, Jonathan | Datta, Romina | Bunyan, Oskar | Wu, Shimu | Zhang, John | Stanczyk, Piotr | Zhang, Ye | Steiner, David | Naskar, Subhajit | Azzam, Michael | Johnson, Matthew | Paszke, Adam | Chiu, Chung-Cheng | Elias, Jaume Sanchez | Mohiuddin, Afroz | Muhammad, Faizan | Miao, Jin | Lee, Andrew | Vieillard, Nino | Park, Jane | Zhang, Jiageng | Stanway, Jeff | Garmon, Drew | Karmarkar, Abhijit | Dong, Zhe | Lee, Jong | Kumar, Aviral | Zhou, Luowei | Evens, Jonathan | Isaac, William | Irving, Geoffrey | Loper, Edward | Fink, Michael | Arkatkar, Isha | Chen, Nanxin | Shafran, Izhak | Petrychenko, Ivan | Chen, Zhe | Jia, Johnson | Levskaya, Anselm | Zhu, Zhenkai | Grabowski, Peter | Mao, Yu | Magni, Alberto | Yao, Kaisheng | Snaider, Javier | Casagrande, Norman | Palmer, Evan | Suganthan, Paul | Castaño, Alfonso | Giannoumis, Irene | Kim, Wooyeol | Rybiński, Mikołaj | Sreevatsa, Ashwin | Prendki, Jennifer | Soergel, David | Goedeckemeyer, Adrian | Gierke, Willi | Jafari, Mohsen | Gaba, Meenu | Wiesner, Jeremy | Wright, Diana Gage | Wei, Yawen | Vashisht, Harsha | Kulizhskaya, Yana | Hoover, Jay | Le, Maigo | Li, Lu | Iwuanyanwu, Chimezie | Liu, Lu | Ramirez, Kevin | Khorlin, Andrey | Cui, Albert | LIN, Tian | Wu, Marcus | Aguilar, Ricardo | Pallo, Keith | Chakladar, Abhishek | Perng, Ginger | Abellan, Elena Allica | Zhang, Mingyang | Dasgupta, Ishita | Kushman, Nate | Penchev, Ivo | Repina, Alena | Wu, Xihui | van der Weide, Tom | Ponnapalli, Priya | Kaplan, Caroline | Simsa, Jiri | Li, Shuangfeng | Dousse, Olivier | Piper, Jeff | Ie, Nathan | Pasumarthi, Rama | Lintz, Nathan | Vijayakumar, Anitha | Andor, Daniel | Valenzuela, Pedro | Lui, Minnie | Paduraru, Cosmin | Peng, Daiyi | Lee, Katherine | Zhang, Shuyuan | Greene, Somer | Nguyen, Duc Dung | Kurylowicz, Paula | Hardin, Cassidy | Dixon, Lucas | Janzer, Lili | Choo, Kiam | Feng, Ziqiang | Zhang, Biao | Singhal, Achintya | Du, Dayou | McKinnon, Dan | Antropova, Natasha | Bolukbasi, Tolga | Keller, Orgad | Reid, David | Finchelstein, Daniel | Raad, Maria Abi | Crocker, Remi | Hawkins, Peter | Dadashi, Robert | Gaffney, Colin | Franko, Ken | Bulanova, Anna | Leblond, Rémi | Chung, Shirley | Askham, Harry | Cobo, Luis C. | Xu, Kelvin | Fischer, Felix | Xu, Jun | Sorokin, Christina | Alberti, Chris | Lin, Chu-Cheng | Evans, Colin | Dimitriev, Alek | Forbes, Hannah | Banarse, Dylan | Tung, Zora | Omernick, Mark | Bishop, Colton | Sterneck, Rachel | Jain, Rohan | Xia, Jiawei | Amid, Ehsan | Piccinno, Francesco | Wang, Xingyu | Banzal, Praseem | Mankowitz, Daniel J. | Polozov, Alex | Krakovna, Victoria | Brown, Sasha | Bateni, MohammadHossein | Duan, Dennis | Firoiu, Vlad | Thotakuri, Meghana | Natan, Tom | Geist, Matthieu | Girgin, Ser tan | Li, Hui | Ye, Jiayu | Roval, Ofir | Tojo, Reiko | Kwong, Michael | Lee-Thorp, James | Yew, Christopher | Sinopalnikov, Danila | Ramos, Sabela | Mellor, John | Sharma, Abhishek | Wu, Kathy | Miller, David | Sonnerat, Nicolas | Vnukov, Denis | Greig, Rory | Beattie, Jennifer | Caveness, Emily | Bai, Libin | Eisenschlos, Julian | Korchemniy, Alex | Tsai, Tomy | Jasarevic, Mimi | Kong, Weize | Dao, Phuong | Zheng, Zeyu | Liu, Frederick | Zhu, Rui | Teh, Tian Huey | Sanmiya, Jason | Gladchenko, Evgeny | Trdin, Nejc | Toyama, Daniel | Rosen, Evan | Tavakkol, Sasan | Xue, Linting | Elkind, Chen | Woodman, Oliver | Carpenter, John | Papamakarios, George | Kemp, Rupert | Kafle, Sushant | Grunina, Tanya | Sinha, Rishika | Talbert, Alice | Wu, Diane | Owusu-Afriyie, Denese | Thornton, Chloe | Pont-Tuset, Jordi | Narayana, Pradyumna | Li, Jing | Fatehi, Saaber | Wieting, John | Ajmeri, Omar | Uria, Benigno | Ko, Yeongil | Knight, Laura | Héliou, Amélie | Niu, Ning | Gu, Shane | Pang, Chenxi | Li, Yeqing | Levine, Nir | Stolovich, Ariel | Santamaria-Fernandez, Rebeca | Goenka, Sonam | Yustalim, Wenny | Strudel, Robin | Elqursh, Ali | Deck, Charlie | Lee, Hyo | Li, Zonglin | Levin, Kyle | Hoffmann, Raphael | Holtmann-Rice, Dan | Bachem, Olivier | Arora, Sho | Koh, Christy | Yeganeh, Soheil Hassas | Põder, Siim | Tariq, Mukarram | Sun, Yanhua | Ionita, Lucian | Seyedhosseini, Mojtaba | Tafti, Pouya | Liu, Zhiyu | Gulati, Anmol | Liu, Jasmine | Ye, Xinyu | Chrzaszcz, Bart | Wang, Lily | Sethi, Nikhil | Li, Tianrun | Brown, Ben | Singh, Shreya | Fan, Wei | Parisi, Aaron | Stanton, Joe | Koverkathu, Vinod | Choquette-Choo, Christopher A. | Li, Yunjie | Lu, TJ | Shroff, Prakash | Varadarajan, Mani | Bahargam, Sanaz | Willoughby, Rob | Gaddy, David | Desjardins, Guillaume | Cornero, Marco | Robenek, Brona | Mittal, Bhavishya | Albrecht, Ben | Shenoy, Ashish | Moiseev, Fedor | Jacobsson, Henrik | Ghaffarkhah, Alireza | Rivière, Morgane | Walton, Alanna | Crepy, Clément | Parrish, Alicia | Zhou, Zongwei | Farabet, Clement | Radebaugh, Carey | Srinivasan, Praveen | van der Salm, Claudia | Fidjeland, Andreas | Scellato, Salvatore | Latorre-Chimoto, Eri | Klimczak-Plucińska, Hanna | Bridson, David | de Cesare, Dario | Hudson, Tom | Mendolicchio, Piermaria | Walker, Lexi | Morris, Alex | Mauger, Matthew | Guseynov, Alexey | Reid, Alison | Odoom, Seth | Loher, Lucia | Cotruta, Victor | Yenugula, Madhavi | Grewe, Dominik | Petrushkina, Anastasia | Duerig, Tom | Sanchez, Antonio | Yadlowsky, Steve | Shen, Amy | Globerson, Amir | Webb, Lynette | Dua, Sahil | Li, Dong | Bhupatiraju, Surya | Hurt, Dan | Qureshi, Haroon | Agarwal, Ananth | Shani, Tomer | Eyal, Matan | Khare, Anuj | Belle, Shreyas Rammohan | Wang, Lei | Tekur, Chetan | Kale, Mihir Sanjay | Wei, Jinliang | Sang, Ruoxin | Saeta, Brennan | Liechty, Tyler | Sun, Yi | Zhao, Yao | Lee, Stephan | Nayak, Pandu | Fritz, Doug | Vuyyuru, Manish Reddy | Aslanides, John | Vyas, Nidhi | Wicke, Martin | Ma, Xiao | Eltyshev, Evgenii | Martin, Nina | Cate, Hardie | Manyika, James | Amiri, Keyvan | Kim, Yelin | Xiong, Xi | Kang, Kai | Luisier, Florian | Tripuraneni, Nilesh | Madras, David | Guo, Mandy | Waters, Austin | Wang, Oliver | Ainslie, Joshua | Baldridge, Jason | Zhang, Han | Pruthi, Garima | Bauer, Jakob | Yang, Feng | Mansour, Riham | Gelman, Jason | Xu, Yang | Polovets, George | Liu, Ji | Cai, Honglong | Chen, Warren | Sheng, XiangHai | Xue, Emily | Ozair, Sherjil | Angermueller, Christof | Li, Xiaowei | Sinha, Anoop | Wang, Weiren | Wiesinger, Julia | Koukoumidis, Emmanouil | Tian, Yuan | Iyer, Anand | Gurumurthy, Madhu | Goldenson, Mark | Shah, Parashar | Blake, MK | Yu, Hongkun | Urbanowicz, Anthony | Palomaki, Jennimaria | Fernando, Chrisantha | Durden, Ken | Mehta, Harsh | Momchev, Nikola | Rahimtoroghi, Elahe | Georgaki, Maria | Raul, Amit | Ruder, Sebastian | Redshaw, Morgan | Lee, Jinhyuk | Zhou, Denny | Jalan, Komal | Li, Dinghua | Hechtman, Blake | Schuh, Parker | Nasr, Milad | Milan, Kieran | Mikulik, Vladimir | Franco, Juliana | Green, Tim | Nguyen, Nam | Kelley, Joe | Mahendru, Aroma | Hu, Andrea | Howland, Joshua | Vargas, Ben | Hui, Jeffrey | Bansal, Kshitij | Rao, Vikram | Ghiya, Rakesh | Wang, Emma | Ye, Ke | Sarr, Jean Michel | Preston, Melanie Moranski | Elish, Madeleine | Li, Steve | Kaku, Aakash | Gupta, Jigar | Pasupat, Ice | Juan, Da-Cheng | Someswar, Milan | M., Tejvi | Chen, Xinyun | Amini, Aida | Fabrikant, Alex | Chu, Eric | Dong, Xuanyi | Muthal, Amruta | Buthpitiya, Senaka | Jauhari, Sarthak | Khandelwal, Urvashi | Hitron, Ayal | Ren, Jie | Rinaldi, Larissa | Drath, Shahar | Dabush, Avigail | Jiang, Nan-Jiang | Godhia, Harshal | Sachs, Uli | Chen, Anthony | Fan, Yicheng | Taitelbaum, Hagai | Noga, Hila | Dai, Zhuyun | Wang, James | Hamer, Jenny | Ferng, Chun-Sung | Elkind, Chenel | Atias, Aviel | Lee, Paulina | Listík, Vít | Carlen, Mathias | van de Kerkhof, Jan | Pikus, Marcin | Zaher, Krunoslav | Müller, Paul | Zykova, Sasha | Stefanec, Richard | Gatsko, Vitaly | Hirnschall, Christoph | Sethi, Ashwin | Xu, Xingyu Federico | Ahuja, Chetan | Tsai, Beth | Stefanoiu, Anca | Feng, Bo | Dhandhania, Keshav | Katyal, Manish | Gupta, Akshay | Parulekar, Atharva | Pitta, Divya | Zhao, Jing | Bhatia, Vivaan | Bhavnani, Yashodha | Alhadlaq, Omar | Li, Xiaolin | Danenberg, Peter | Tu, Dennis | Pine, Alex | Filippova, Vera | Ghosh, Abhipso | Limonchik, Ben | Urala, Bhargava | Lanka, Chaitanya Krishna | Clive, Derik | Li, Edward | Wu, Hao | Hongtongsak, Kevin | Li, Ianna | Thakkar, Kalind | Omarov, Kuanysh | Majmundar, Kushal | Alverson, Michael | Kucharski, Michael | Patel, Mohak | Jain, Mudit | Zabelin, Maksim | Pelagatti, Paolo | Kohli, Rohan | Kumar, Saurabh | Kim, Joseph | Sankar, Swetha | Shah, Vineet | Ramachandruni, Lakshmi | Zeng, Xiangkai | Bariach, Ben | Weidinger, Laura | Vu, Tu | Andreev, Alek | He, Antoine | Hui, Kevin | Kashem, Sheleem | Subramanya, Amar | Hsiao, Sissie | Hassabis, Demis | Kavukcuoglu, Koray | Sadovsky, Adam | Le, Quoc | Strohman, Trevor | Wu, Yonghui | Petrov, Slav | Dean, Jeffrey | Vinyals, Oriol

Dec 18, 2023 – The report introduces the Gemini family of multimodal models, including Ultra, Pro, and Nano sizes, which excel in image, audio, video, and text understanding. The most advanced model, Gemini Ultra, outperforms existing benchmarks in 30 out of 32 tasks, achieving human-expert performance in some cases and improving state-of-the-art results in all examined multimodal benchmarks.

The Llama 3 Herd of Models

PageRank: 4,131
Growth: +3,358%
Citations: 1,238

Dubey, Abhimanyu | Jauhri, Abhinav | Pandey, Abhinav | Kadian, Abhishek | Al-Dahle, Ahmad | Letman, Aiesha | Mathur, Akhil | Schelten, Alan | Yang, Amy | Fan, Angela | Goyal, Anirudh | Hartshorn, Anthony | Yang, Aobo | Mitra, Archi | Sravankumar, Archie | Korenev, Artem | Hinsvark, Arthur | Rao, Arun | Zhang, Aston | Rodriguez, Aurelien | Gregerson, Austen | Spataru, Ava | Roziere, Baptiste | Biron, Bethany | Tang, Binh | Chern, Bobbie | Caucheteux, Charlotte | Nayak, Chaya | Bi, Chloe | Marra, Chris | McConnell, Chris | Keller, Christian | Touret, Christophe | Wu, Chunyang | Wong, Corinne | Ferrer, Cristian Canton | Nikolaidis, Cyrus | Allonsius, Damien | Song, Daniel | Pintz, Danielle | Livshits, Danny | Esiobu, David | Choudhary, Dhruv | Mahajan, Dhruv | Garcia-Olano, Diego | Perino, Diego | Hupkes, Dieuwke | Lakomkin, Egor | AlBadawy, Ehab | Lobanova, Elina | Dinan, Emily | Smith, Eric Michael | Radenovic, Filip | Zhang, Frank | Synnaeve, Gabriel | Lee, Gabrielle | Anderson, Georgia Lewis | Nail, Graeme | Mialon, Gregoire | Pang, Guan | Cucurell, Guillem | Nguyen, Hailey | Korevaar, Hannah | Xu, Hu | Touvron, Hugo | Zarov, Iliyan | Ibarra, Imanol Arrieta | Kloumann, Isabel | Misra, Ishan | Evtimov, Ivan | Copet, Jade | Lee, Jaewon | Geffert, Jan | Vranes, Jana | Park, Jason | Mahadeokar, Jay | Shah, Jeet | van der Linde, Jelmer | Billock, Jennifer | Hong, Jenny | Lee, Jenya | Fu, Jeremy | Chi, Jianfeng | Huang, Jianyu | Liu, Jiawen | Wang, Jie | Yu, Jiecao | Bitton, Joanna | Spisak, Joe | Park, Jongsoo | Rocca, Joseph | Johnstun, Joshua | Saxe, Joshua | Jia, Junteng | Alwala, Kalyan Vasuden | Upasani, Kartikeya | Plawiak, Kate | Li, Ke | Heafield, Kenneth | Stone, Kevin | El-Arini, Khalid | Iyer, Krithika | Malik, Kshitiz | Chiu, Kuenley | Bhalla, Kunal | Rantala-Yeary, Lauren | van der Maaten, Laurens | Chen, Lawrence | Tan, Liang | Jenkins, Liz | Martin, Louis | Madaan, Lovish | Malo, Lubo | Blecher, Lukas | Landzaat, Lukas | de Oliveira, Luke | Muzzi, Madeline | Pasupuleti, Mahesh | Singh, Mannat | Paluri, Manohar | Kardas, Marcin | Oldham, Mathew | Rita, Mathieu | Pavlova, Maya | Kambadur, Melanie | Lewis, Mike | Si, Min | Singh, Mitesh Kumar | Hassan, Mona | Goyal, Naman | Torabi, Narjes | Bashlykov, Nikolay | Bogoychev, Nikolay | Chatterji, Niladri | Duchenne, Olivier | Çelebi, Onur | Alrassy, Patrick | Zhang, Pengchuan | Li, Pengwei | Vasic, Petar | Weng, Peter | Bhargava, Prajjwal | Dubal, Pratik | Krishnan, Praveen | Koura, Punit Singh | Xu, Puxin | He, Qing | Dong, Qingxiao | Srinivasan, Ragavan | Ganapathy, Raj | Calderer, Ramon | Cabral, Ricardo Silveira | Stojnic, Robert | Raileanu, Roberta | Girdhar, Rohit | Patel, Rohit | Sauvestre, Romain | Polidoro, Ronnie | Sumbaly, Roshan | Taylor, Ross | Silva, Ruan | Hou, Rui | Wang, Rui | Hosseini, Saghar | Chennabasappa, Sahana | Singh, Sanjay | Bell, Sean | Kim, Seohyun Sonia | Edunov, Sergey | Nie, Shaoliang | Narang, Sharan | Raparthy, Sharath | Shen, Sheng | Wan, Shengye | Bhosale, Shruti | Zhang, Shun | Vandenhende, Simon | Batra, Soumya | Whitman, Spencer | Sootla, Sten | Collot, Stephane | Gururangan, Suchin | Borodinsky, Sydney | Herman, Tamar | Fowler, Tara | Sheasha, Tarek | Georgiou, Thomas | Scialom, Thomas | Speckbacher, Tobias | Mihaylov, Todor | Xiao, Tong | Karn, Ujjwal | Goswami, Vedanuj | Gupta, Vibhor | Ramanathan, Vignesh | Kerkez, Viktor | Gonguet, Vincent | Do, Virginie | Vogeti, Vish | Petrovic, Vladan | Chu, Weiwei | Xiong, Wenhan | Fu, Wenyin | Meers, Whitney | Martinet, Xavier | Wang, Xiaodong | Tan, Xiaoqing Ellen | Xie, Xinfeng | Jia, Xuchao | Wang, Xuewei | Goldschlag, Yaelle | Gaur, Yashesh | Babaei, Yasmine | Wen, Yi | Song, Yiwen | Zhang, Yuchen | Li, Yue | Mao, Yuning | Coudert, Zacharie Delpierre | Yan, Zheng | Chen, Zhengxing | Papakipos, Zoe | Singh, Aaditya | Grattafiori, Aaron | Jain, Abha | Kelsey, Adam | Shajnfeld, Adam | Gangidi, Adithya | Victoria, Adolfo | Goldstand, Ahuva | Menon, Ajay | Sharma, Ajay | Boesenberg, Alex | Vaughan, Alex | Baevski, Alexei | Feinstein, Allie | Kallet, Amanda | Sangani, Amit | Yunus, Anam | Lupu, Andrei | Alvarado, Andres | Caples, Andrew | Gu, Andrew | Ho, Andrew | Poulton, Andrew | Ryan, Andrew | Ramchandani, Ankit | Franco, Annie | Saraf, Aparajita | Chowdhury, Arkabandhu | Gabriel, Ashley | Bharambe, Ashwin | Eisenman, Assaf | Yazdan, Azadeh | James, Beau | Maurer, Ben | Leonhardi, Benjamin | Huang, Bernie | Loyd, Beth | De Paola, Beto | Paranjape, Bhargavi | Liu, Bing | Wu, Bo | Ni, Boyu | Hancock, Braden | Wasti, Bram | Spence, Brandon | Stojkovic, Brani | Gamido, Brian | Montalvo, Britt | Parker, Carl | Burton, Carly | Mejia, Catalina | Wang, Changhan | Kim, Changkyu | Zhou, Chao | Hu, Chester | Chu, Ching-Hsiang | Cai, Chris | Tindal, Chris | Feichtenhofer, Christoph | Civin, Damon | Beaty, Dana | Kreymer, Daniel | Li, Daniel | Wyatt, Danny | Adkins, David | Xu, David | Testuggine, Davide | David, Delia | Parikh, Devi | Liskovich, Diana | Foss, Didem | Wang, Dingkang | Le, Duc | Holland, Dustin | Dowling, Edward | Jamil, Eissa | Montgomery, Elaine | Presani, Eleonora | Hahn, Emily | Wood, Emily | Brinkman, Erik | Arcaute, Esteban | Dunbar, Evan | Smothers, Evan | Sun, Fei | Kreuk, Felix | Tian, Feng | Ozgenel, Firat | Caggioni, Francesco | Guzmán, Francisco | Kanayet, Frank | Seide, Frank | Florez, Gabriela Medina | Schwarz, Gabriella | Badeer, Gada | Swee, Georgia | Halpern, Gil | Thattai, Govind | Herman, Grant | Sizov, Grigory | Guangyi | Zhang | Lakshminarayanan, Guna | Shojanazeri, Hamid | Zou, Han | Wang, Hannah | Zha, Hanwen | Habeeb, Haroun | Rudolph, Harrison | Suk, Helen | Aspegren, Henry | Goldman, Hunter | Damlaj, Ibrahim | Molybog, Igor | Tufanov, Igor | Veliche, Irina-Elena | Gat, Itai | Weissman, Jake | Geboski, James | Kohli, James | Asher, Japhet | Gaya, Jean-Baptiste | Marcus, Jeff | Tang, Jeff | Chan, Jennifer | Zhen, Jenny | Reizenstein, Jeremy | Teboul, Jeremy | Zhong, Jessica | Jin, Jian | Yang, Jingyi | Cummings, Joe | Carvill, Jon | Shepard, Jon | McPhie, Jonathan | Torres, Jonathan | Ginsburg, Josh | Wang, Junjie | Wu, Kai | U, Kam Hou | Saxena, Karan | Prasad, Karthik | Khandelwal, Kartikay | Zand, Katayoun | Matosich, Kathy | Veeraraghavan, Kaushik | Michelena, Kelly | Li, Keqian | Huang, Kun | Chawla, Kunal | Lakhotia, Kushal | Huang, Kyle | Chen, Lailin | Garg, Lakshya | A, Lavender | Silva, Leandro | Bell, Lee | Zhang, Lei | Guo, Liangpeng | Yu, Licheng | Moshkovich, Liron | Wehrstedt, Luca | Khabsa, Madian | Avalani, Manav | Bhatt, Manish | Tsimpoukelli, Maria | Mankus, Martynas | Hasson, Matan | Lennie, Matthew | Reso, Matthias | Groshev, Maxim | Naumov, Maxim | Lathi, Maya | Keneally, Meghan | Seltzer, Michael L. | Valko, Michal | Restrepo, Michelle | Patel, Mihir | Vyatskov, Mik | Samvelyan, Mikayel | Clark, Mike | Macey, Mike | Wang, Mike | Hermoso, Miquel Jubert | Metanat, Mo | Rastegari, Mohammad | Bansal, Munish | Santhanam, Nandhini | Parks, Natascha | White, Natasha | Bawa, Navyata | Singhal, Nayan | Egebo, Nick | Usunier, Nicolas | Laptev, Nikolay Pavlovich | Dong, Ning | Zhang, Ning | Cheng, Norman | Chernoguz, Oleg | Hart, Olivia | Salpekar, Omkar | Kalinli, Ozlem | Kent, Parkin | Parekh, Parth | Saab, Paul | Balaji, Pavan | Rittner, Pedro | Bontrager, Philip | Roux, Pierre | Dollar, Piotr | Zvyagina, Polina | Ratanchandani, Prashant | Yuvraj, Pritish | Liang, Qian | Alao, Rachad | Rodriguez, Rachel | Ayub, Rafi | Murthy, Raghotham | Nayani, Raghu | Mitra, Rahul | Li, Raymond | Hogan, Rebekkah | Battey, Robin | Wang, Rocky | Maheswari, Rohan | Howes, Russ | Rinott, Ruty | Bondu, Sai Jayesh | Datta, Samyak | Chugh, Sara | Hunt, Sara | Dhillon, Sargun | Sidorov, Sasha | Pan, Satadru | Verma, Saurabh | Yamamoto, Seiji | Ramaswamy, Sharadh | Lindsay, Shaun | Feng, Sheng | Lin, Shenghao | Zha, Shengxin Cindy | Shankar, Shiva | Zhang, Shuqiang | Wang, Sinong | Agarwal, Sneha | Sajuyigbe, Soji | Chintala, Soumith | Max, Stephanie | Chen, Stephen | Kehoe, Steve | Satterfield, Steve | Govindaprasad, Sudarshan | Gupta, Sumit | Cho, Sungmin | Virk, Sunny | Subramanian, Suraj | Choudhury, Sy | Goldman, Sydney | Remez, Tal | Glaser, Tamar | Best, Tamara | Kohler, Thilo | Robinson, Thomas | Li, Tianhe | Zhang, Tianjun | Matthews, Tim | Chou, Timothy | Shaked, Tzook | Vontimitta, Varun | Ajayi, Victoria | Montanez, Victoria | Mohan, Vijai | Kumar, Vinay Satish | Mangla, Vishal | Albiero, Vítor | Ionescu, Vlad | Poenaru, Vlad | Mihailescu, Vlad Tiberiu | Ivanov, Vladimir | Li, Wei | Wang, Wenchen | Jiang, Wenwen | Bouaziz, Wes | Constable, Will | Tang, Xiaocheng | Wang, Xiaofang | Wu, Xiaojian | Wang, Xiaolan | Xia, Xide | Wu, Xilun | Gao, Xinbo | Chen, Yanjun | Hu, Ye | Jia, Ye | Qi, Ye | Li, Yenda | Zhang, Yilin | Zhang, Ying | Adi, Yossi | Nam, Youngjin | Yu | Wang | Hao, Yuchen | Qian, Yundi | He, Yuzi | Rait, Zach | DeVito, Zachary | Rosnbrick, Zef | Wen, Zhaoduo | Yang, Zhenyu | Zhao, Zhiwei

Jul 31, 2024 – The paper introduces Llama 3, a new set of multilingual foundation models with advanced capabilities such as coding, reasoning, and tool usage. Llama 3, including a 405B parameter language model and safety features, shows comparable performance to leading language models like GPT-4 across various tasks, and experimental integration of image, video, and speech capabilities demonstrates competitive results, although the models are still in development and not widely available.

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

PageRank: 4,061
Growth: +2,436%
Citations: 1,124

Gu, Albert | Dao, Tri

Dec 1, 2023 – The paper introduces Mamba, a new sequence modeling architecture that addresses the computational inefficiency of Transformers on long sequences. By incorporating selective state spaces and making improvements to content-based reasoning, Mamba achieves fast inference, linear scaling in sequence length, and outperforms Transformers in various modalities such as language, audio, and genomics.

Improved Baselines with Visual Instruction Tuning

PageRank: 4,211
Growth: +2,237%
Citations: 1,424

Liu, Haotian | Li, Chunyuan | Li, Yuheng | Lee, Yong Jae

Oct 5, 2023 – The authors of this note demonstrate that making simple modifications to the LLaVA model, such as using CLIP-ViT-L-336px with an MLP projection and incorporating academic-task-oriented VQA data, can lead to stronger baselines and achieve state-of-the-art results on 11 benchmarks. Their final 13B checkpoint only requires 1.2M publicly available data and can be trained in approximately one day on a single 8-A100 node, making it more accessible for LMM research.

Qwen Technical Report

PageRank: 3,999
Growth: +2,135%
Citations: 1,272

Bai, Jinze | Bai, Shuai | Chu, Yunfei | Cui, Zeyu | Dang, Kai | Deng, Xiaodong | Fan, Yang | Ge, Wenbin | Han, Yu | Huang, Fei | Hui, Binyuan | Ji, Luo | Li, Mei | Lin, Junyang | Lin, Runji | Liu, Dayiheng | Liu, Gao | Lu, Chengqiang | Lu, Keming | Ma, Jianxin | Men, Rui | Ren, Xingzhang | Ren, Xuancheng | Tan, Chuanqi | Tan, Sinan | Tu, Jianhong | Wang, Peng | Wang, Shijie | Wang, Wei | Wu, Shengguang | Xu, Benfeng | Xu, Jin | Yang, An | Yang, Hao | Yang, Jian | Yang, Shusheng | Yao, Yang | Yu, Bowen | Yuan, Hongyi | Yuan, Zheng | Zhang, Jianwei | Zhang, Xingxuan | Zhang, Yichang | Zhang, Zhenru | Zhou, Chang | Zhou, Jingren | Zhou, Xiaohuan | Zhu, Tianhang

Sep 28, 2023 – The Qwen language model series, including base models and chat models, has been introduced as a comprehensive and high-performing solution for natural language processing tasks. The models demonstrate superior performance in various downstream tasks, with the chat models showcasing advanced tool-use and planning capabilities, particularly in coding and mathematics-focused applications.

3D Gaussian Splatting for Real-Time Radiance Field Rendering

PageRank: 3,460
Growth: +2,013%
Citations: 1,496

Kerbl, Bernhard | Kopanas, Georgios | Leimkühler, Thomas | Drettakis, George

Aug 8, 2023 – This paper introduces a method for real-time rendering of radiance fields, which achieves high visual quality while maintaining competitive training times. The method utilizes 3D Gaussians to represent the scene, performs optimization of the Gaussians, and incorporates a fast visibility-aware rendering algorithm.

Mixtral of Experts

PageRank: 3,220
Growth: +1,712%
Citations: 1,358

Jiang, Albert Q. | Sablayrolles, Alexandre | Roux, Antoine | Mensch, Arthur | Savary, Blanche | Bamford, Chris | Chaplot, Devendra Singh | Casas, Diego de las | Hanna, Emma Bou | Bressand, Florian | Lengyel, Gianna | Bour, Guillaume | Lample, Guillaume | Lavaud, Lélio Renard | Saulnier, Lucile | Lachaux, Marie-Anne | Stock, Pierre | Subramanian, Sandeep | Yang, Sophia | Antoniak, Szymon | Scao, Teven Le | Gervet, Théophile | Lavril, Thibaut | Wang, Thomas | Lacroix, Timothée | Sayed, William El

Jan 8, 2024 – Mixtral 8x7B is a Sparse Mixture of Experts (SMoE) language model that outperforms other models in various benchmarks, including mathematics, code generation, and multilingual tasks. It has 47B parameters but only uses 13B active parameters during inference.

Efficient Memory Management for Large Language Model Serving with PagedAttention

PageRank: 4,891
Growth: +1,615%
Citations: 814

Kwon, Woosuk | Li, Zhuohan | Zhuang, Siyuan | Sheng, Ying | Zheng, Lianmin | Yu, Cody Hao | Gonzalez, Joseph E. | Zhang, Hao | Stoica, Ion

Sep 12, 2023 – The paper proposes PagedAttention, an attention algorithm inspired by virtual memory and paging techniques, to efficiently manage memory for large language model serving. The proposed system, vLLM, achieves near-zero waste in memory and flexible sharing of memory within and across requests, resulting in improved throughput compared to existing systems.

Gemma: Open Models Based on Gemini Research and Technology

PageRank: 6,495
Growth: +1,368%
Citations: 806

Gemma Team | Mesnard, Thomas | Hardin, Cassidy | Dadashi, Robert | Bhupatiraju, Surya | Pathak, Shreya | Sifre, Laurent | Rivière, Morgane | Kale, Mihir Sanjay | Love, Juliette | Tafti, Pouya | Hussenot, Léonard | Sessa, Pier Giuseppe | Chowdhery, Aakanksha | Roberts, Adam | Barua, Aditya | Botev, Alex | Castro-Ros, Alex | Slone, Ambrose | Héliou, Amélie | Tacchetti, Andrea | Bulanova, Anna | Paterson, Antonia | Tsai, Beth | Shahriari, Bobak | Lan, Charline Le | Choquette-Choo, Christopher A. | Crepy, Clément | Cer, Daniel | Ippolito, Daphne | Reid, David | Buchatskaya, Elena | Ni, Eric | Noland, Eric | Yan, Geng | Tucker, George | Muraru, George-Christian | Rozhdestvenskiy, Grigory | Michalewski, Henryk | Tenney, Ian | Grishchenko, Ivan | Austin, Jacob | Keeling, James | Labanowski, Jane | Lespiau, Jean-Baptiste | Stanway, Jeff | Brennan, Jenny | Chen, Jeremy | Ferret, Johan | Chiu, Justin | Mao-Jones, Justin | Lee, Katherine | Yu, Kathy | Millican, Katie | Sjoesund, Lars Lowe | Lee, Lisa | Dixon, Lucas | Reid, Machel | Mikuła, Maciej | Wirth, Mateo | Sharman, Michael | Chinaev, Nikolai | Thain, Nithum | Bachem, Olivier | Chang, Oscar | Wahltinez, Oscar | Bailey, Paige | Michel, Paul | Yotov, Petko | Chaabouni, Rahma | Comanescu, Ramona | Jana, Reena | Anil, Rohan | McIlroy, Ross | Liu, Ruibo | Mullins, Ryan | Smith, Samuel L | Borgeaud, Sebastian | Girgin, Sertan | Douglas, Sholto | Pandya, Shree | Shakeri, Siamak | De, Soham | Klimenko, Ted | Hennigan, Tom | Feinberg, Vlad | Stokowiec, Wojciech | Chen, Yu-hui | Ahmed, Zafarali | Gong, Zhitao | Warkentin, Tris | Peran, Ludovic | Giang, Minh | Farabet, Clément | Vinyals, Oriol | Dean, Jeff | Kavukcuoglu, Koray | Hassabis, Demis | Ghahramani, Zoubin | Eck, Douglas | Barral, Joelle | Pereira, Fernando | Collins, Eli | Joulin, Armand | Fiedel, Noah | Senter, Evan | Andreev, Alek | Kenealy, Kathleen

Mar 13, 2024 – Gemma is a new family of open models based on Gemini research and technology, offering strong performance across various language understanding tasks. Available in two sizes with pretrained and fine-tuned checkpoints, Gemma outperforms other models on text-based tasks and emphasizes safety and responsibility in model development.

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

PageRank: 8,812
Growth: +1,366%
Citations: 641

Gemini Team | Georgiev, Petko | Lei, Ving Ian | Burnell, Ryan | Bai, Libin | Gulati, Anmol | Tanzer, Garrett | Vincent, Damien | Pan, Zhufeng | Wang, Shibo | Mariooryad, Soroosh | Ding, Yifan | Geng, Xinyang | Alcober, Fred | Frostig, Roy | Omernick, Mark | Walker, Lexi | Paduraru, Cosmin | Sorokin, Christina | Tacchetti, Andrea | Gaffney, Colin | Daruki, Samira | Sercinoglu, Olcan | Gleicher, Zach | Love, Juliette | Voigtlaender, Paul | Jain, Rohan | Surita, Gabriela | Mohamed, Kareem | Blevins, Rory | Ahn, Junwhan | Zhu, Tao | Kawintiranon, Kornraphop | Firat, Orhan | Gu, Yiming | Zhang, Yujing | Rahtz, Matthew | Faruqui, Manaal | Clay, Natalie | Gilmer, Justin | Co-Reyes, JD | Penchev, Ivo | Zhu, Rui | Morioka, Nobuyuki | Hui, Kevin | Haridasan, Krishna | Campos, Victor | Mahdieh, Mahdis | Guo, Mandy | Hassan, Samer | Kilgour, Kevin | Vezer, Arpi | Cheng, Heng-Tze | de Liedekerke, Raoul | Goyal, Siddharth | Barham, Paul | Strouse, DJ | Noury, Seb | Adler, Jonas | Sundararajan, Mukund | Vikram, Sharad | Lepikhin, Dmitry | Paganini, Michela | Garcia, Xavier | Yang, Fan | Valter, Dasha | Trebacz, Maja | Vodrahalli, Kiran | Asawaroengchai, Chulayuth | Ring, Roman | Kalb, Norbert | Soares, Livio Baldini | Brahma, Siddhartha | Steiner, David | Yu, Tianhe | Mentzer, Fabian | He, Antoine | Gonzalez, Lucas | Xu, Bibo | Kaufman, Raphael Lopez | Shafey, Laurent El | Oh, Junhyuk | Hennigan, Tom | Driessche, George van den | Odoom, Seth | Lucic, Mario | Roelofs, Becca | Lall, Sid | Marathe, Amit | Chan, Betty | Ontanon, Santiago | He, Luheng | Teplyashin, Denis | Lai, Jonathan | Crone, Phil | Damoc, Bogdan | Ho, Lewis | Riedel, Sebastian | Lenc, Karel | Yeh, Chih-Kuan | Chowdhery, Aakanksha | Xu, Yang | Kazemi, Mehran | Amid, Ehsan | Petrushkina, Anastasia | Swersky, Kevin | Khodaei, Ali | Chen, Gowoon | Larkin, Chris | Pinto, Mario | Yan, Geng | Badia, Adria Puigdomenech | Patil, Piyush | Hansen, Steven | Orr, Dave | Arnold, Sebastien M. R. | Grimstad, Jordan | Dai, Andrew | Douglas, Sholto | Sinha, Rishika | Yadav, Vikas | Chen, Xi | Gribovskaya, Elena | Austin, Jacob | Zhao, Jeffrey | Patel, Kaushal | Komarek, Paul | Austin, Sophia | Borgeaud, Sebastian | Friso, Linda | Goyal, Abhimanyu | Caine, Ben | Cao, Kris | Chung, Da-Woon | Lamm, Matthew | Barth-Maron, Gabe | Kagohara, Thais | Olszewska, Kate | Chen, Mia | Shivakumar, Kaushik | Agarwal, Rishabh | Godhia, Harshal | Rajwar, Ravi | Snaider, Javier | Dotiwalla, Xerxes | Liu, Yuan | Barua, Aditya | Ungureanu, Victor | Zhang, Yuan | Batsaikhan, Bat-Orgil | Wirth, Mateo | Qin, James | Danihelka, Ivo | Doshi, Tulsee | Chadwick, Martin | Chen, Jilin | Jain, Sanil | Le, Quoc | Kar, Arjun | Gurumurthy, Madhu | Li, Cheng | Sang, Ruoxin | Liu, Fangyu | Lamprou, Lampros | Munoz, Rich | Lintz, Nathan | Mehta, Harsh | Howard, Heidi | Reynolds, Malcolm | Aroyo, Lora | Wang, Quan | Blanco, Lorenzo | Cassirer, Albin | Griffith, Jordan | Das, Dipanjan | Lee, Stephan | Sygnowski, Jakub | Fisher, Zach | Besley, James | Powell, Richard | Ahmed, Zafarali | Paulus, Dominik | Reitter, David | Borsos, Zalan | Joshi, Rishabh | Pope, Aedan | Hand, Steven | Selo, Vittorio | Jain, Vihan | Sethi, Nikhil | Goel, Megha | Makino, Takaki | May, Rhys | Yang, Zhen | Schalkwyk, Johan | Butterfield, Christina | Hauth, Anja | Goldin, Alex | Hawkins, Will | Senter, Evan | Brin, Sergey | Woodman, Oliver | Ritter, Marvin | Noland, Eric | Giang, Minh | Bolina, Vijay | Lee, Lisa | Blyth, Tim | Mackinnon, Ian | Reid, Machel | Sarvana, Obaid | Silver, David | Chen, Alexander | Wang, Lily | Maggiore, Loren | Chang, Oscar | Attaluri, Nithya | Thornton, Gregory | Chiu, Chung-Cheng | Bunyan, Oskar | Levine, Nir | Chung, Timothy | Eltyshev, Evgenii | Si, Xiance | Lillicrap, Timothy | Brady, Demetra | Aggarwal, Vaibhav | Wu, Boxi | Xu, Yuanzhong | McIlroy, Ross | Badola, Kartikeya | Sandhu, Paramjit | Moreira, Erica | Stokowiec, Wojciech | Hemsley, Ross | Li, Dong | Tudor, Alex | Shyam, Pranav | Rahimtoroghi, Elahe | Haykal, Salem | Sprechmann, Pablo | Zhou, Xiang | Mincu, Diana | Li, Yujia | Addanki, Ravi | Krishna, Kalpesh | Wu, Xiao | Frechette, Alexandre | Eyal, Matan | Dafoe, Allan | Lacey, Dave | Whang, Jay | Avrahami, Thi | Zhang, Ye | Taropa, Emanuel | Lin, Hanzhao | Toyama, Daniel | Rutherford, Eliza | Sano, Motoki | Choe, HyunJeong | Tomala, Alex | Safranek-Shrader, Chalence | Kassner, Nora | Pajarskas, Mantas | Harvey, Matt | Sechrist, Sean | Fortunato, Meire | Lyu, Christina | Elsayed, Gamaleldin | Kuang, Chenkai | Lottes, James | Chu, Eric | Jia, Chao | Chen, Chih-Wei | Humphreys, Peter | Baumli, Kate | Tao, Connie | Samuel, Rajkumar | Santos, Cicero Nogueira dos | Andreassen, Anders | Rakićević, Nemanja | Grewe, Dominik | Kumar, Aviral | Winkler, Stephanie | Caton, Jonathan | Brock, Andrew | Dalmia, Sid | Sheahan, Hannah | Barr, Iain | Miao, Yingjie | Natsev, Paul | Devlin, Jacob | Behbahani, Feryal | Prost, Flavien | Sun, Yanhua | Myaskovsky, Artiom | Pillai, Thanumalayan Sankaranarayana | Hurt, Dan | Lazaridou, Angeliki | Xiong, Xi | Zheng, Ce | Pardo, Fabio | Li, Xiaowei | Horgan, Dan | Stanton, Joe | Ambar, Moran | Xia, Fei | Lince, Alejandro | Wang, Mingqiu | Mustafa, Basil | Webson, Albert | Lee, Hyo | Anil, Rohan | Wicke, Martin | Dozat, Timothy | Sinha, Abhishek | Piqueras, Enrique | Dabir, Elahe | Upadhyay, Shyam | Boral, Anudhyan | Hendricks, Lisa Anne | Fry, Corey | Djolonga, Josip | Su, Yi | Walker, Jake | Labanowski, Jane | Huang, Ronny | Misra, Vedant | Chen, Jeremy | Skerry-Ryan, RJ | Singh, Avi | Rijhwani, Shruti | Yu, Dian | Castro-Ros, Alex | Changpinyo, Beer | Datta, Romina | Bagri, Sumit | Hrafnkelsson, Arnar Mar | Maggioni, Marcello | Zheng, Daniel | Sulsky, Yury | Hou, Shaobo | Paine, Tom Le | Yang, Antoine | Riesa, Jason | Rogozinska, Dominika | Marcus, Dror | Badawy, Dalia El | Zhang, Qiao | Wang, Luyu | Miller, Helen | Greer, Jeremy | Sjos, Lars Lowe | Nova, Azade | Zen, Heiga | Chaabouni, Rahma | Rosca, Mihaela | Jiang, Jiepu | Chen, Charlie | Liu, Ruibo | Sainath, Tara | Krikun, Maxim | Polozov, Alex | Lespiau, Jean-Baptiste | Newlan, Josh | Cankara, Zeyncep | Kwak, Soo | Xu, Yunhan | Chen, Phil | Coenen, Andy | Meyer, Clemens | Tsihlas, Katerina | Ma, Ada | Gottweis, Juraj | Xing, Jinwei | Gu, Chenjie | Miao, Jin | Frank, Christian | Cankara, Zeynep | Ganapathy, Sanjay | Dasgupta, Ishita | Hughes-Fitt, Steph | Chen, Heng | Reid, David | Rong, Keran | Fan, Hongmin | van Amersfoort, Joost | Zhuang, Vincent | Cohen, Aaron | Gu, Shixiang Shane | Mohananey, Anhad | Ilic, Anastasija | Tobin, Taylor | Wieting, John | Bortsova, Anna | Thacker, Phoebe | Wang, Emma | Caveness, Emily | Chiu, Justin | Sezener, Eren | Kaskasoli, Alex | Baker, Steven | Millican, Katie | Elhawaty, Mohamed | Aisopos, Kostas | Lebsack, Carl | Byrd, Nathan | Dai, Hanjun | Jia, Wenhao | Wiethoff, Matthew | Davoodi, Elnaz | Weston, Albert | Yagati, Lakshman | Ahuja, Arun | Gao, Isabel | Pundak, Golan | Zhang, Susan | Azzam, Michael | Sim, Khe Chai | Caelles, Sergi | Keeling, James | Sharma, Abhanshu | Swing, Andy | Li, YaGuang | Liu, Chenxi | Bostock, Carrie Grimes | Bansal, Yamini | Nado, Zachary | Anand, Ankesh | Lipschultz, Josh | Karmarkar, Abhijit | Proleev, Lev | Ittycheriah, Abe | Yeganeh, Soheil Hassas | Polovets, George | Faust, Aleksandra | Sun, Jiao | Rrustemi, Alban | Li, Pen | Shivanna, Rakesh | Liu, Jeremiah | Welty, Chris | Lebron, Federico | Baddepudi, Anirudh | Krause, Sebastian | Parisotto, Emilio | Soricut, Radu | Xu, Zheng | Bloxwich, Dawn | Johnson, Melvin | Neyshabur, Behnam | Mao-Jones, Justin | Wang, Renshen | Ramasesh, Vinay | Abbas, Zaheer | Guez, Arthur | Segal, Constant | Nguyen, Duc Dung | Svensson, James | Hou, Le | York, Sarah | Milan, Kieran | Bridgers, Sophie | Gworek, Wiktor | Tagliasacchi, Marco | Lee-Thorp, James | Chang, Michael | Guseynov, Alexey | Hartman, Ale Jakse | Kwong, Michael | Zhao, Ruizhe | Kashem, Sheleem | Cole, Elizabeth | Miech, Antoine | Tanburn, Richard | Phuong, Mary | Pavetic, Filip | Cevey, Sebastien | Comanescu, Ramona | Ives, Richard | Yang, Sherry | Du, Cosmo | Li, Bo | Zhang, Zizhao | Iinuma, Mariko | Hu, Clara Huiyi | Roy, Aurko | Bijwadia, Shaan | Zhu, Zhenkai | Martins, Danilo | Saputro, Rachel | Gergely, Anita | Zheng, Steven | Jia, Dawei | Antonoglou, Ioannis | Sadovsky, Adam | Gu, Shane | Bi, Yingying | Andreev, Alek | Samangooei, Sina | Khan, Mina | Kocisky, Tomas | Filos, Angelos | Kumar, Chintu | Bishop, Colton | Yu, Adams | Hodkinson, Sarah | Mittal, Sid | Shah, Premal | Moufarek, Alexandre | Cheng, Yong | Bloniarz, Adam | Lee, Jaehoon | Pejman, Pedram | Michel, Paul | Spencer, Stephen | Feinberg, Vladimir | Xiong, Xuehan | Savinov, Nikolay | Smith, Charlotte | Shakeri, Siamak | Tran, Dustin | Chesus, Mary | Bohnet, Bernd | Tucker, George | von Glehn, Tamara | Muir, Carrie | Mao, Yiran | Kazawa, Hideto | Slone, Ambrose | Soparkar, Kedar | Shrivastava, Disha | Cobon-Kerr, James | Sharman, Michael | Pavagadhi, Jay | Araya, Carlos | Misiunas, Karolis | Ghelani, Nimesh | Laskin, Michael | Barker, David | Li, Qiujia | Briukhov, Anton | Houlsby, Neil | Glaese, Mia | Lakshminarayanan, Balaji | Schucher, Nathan | Tang, Yunhao | Collins, Eli | Lim, Hyeontaek | Feng, Fangxiaoyu | Recasens, Adria | Lai, Guangda | Magni, Alberto | De Cao, Nicola | Siddhant, Aditya | Ashwood, Zoe | Orbay, Jordi | Dehghani, Mostafa | Brennan, Jenny | He, Yifan | Xu, Kelvin | Gao, Yang | Saroufim, Carl | Molloy, James | Wu, Xinyi | Arnold, Seb | Chang, Solomon | Schrittwieser, Julian | Buchatskaya, Elena | Radpour, Soroush | Polacek, Martin | Giordano, Skye | Bapna, Ankur | Tokumine, Simon | Hellendoorn, Vincent | Sottiaux, Thibault | Cogan, Sarah | Severyn, Aliaksei | Saleh, Mohammad | Thakoor, Shantanu | Shefey, Laurent | Qiao, Siyuan | Gaba, Meenu | Chang, Shuo-yiin | Swanson, Craig | Zhang, Biao | Lee, Benjamin | Rubenstein, Paul Kishan | Song, Gan | Kwiatkowski, Tom | Koop, Anna | Kannan, Ajay | Kao, David | Schuh, Parker | Stjerngren, Axel | Ghiasi, Golnaz | Gibson, Gena | Vilnis, Luke | Yuan, Ye | Ferreira, Felipe Tiengo | Kamath, Aishwarya | Klimenko, Ted | Franko, Ken | Xiao, Kefan | Bhattacharya, Indro | Patel, Miteyan | Wang, Rui | Morris, Alex | Strudel, Robin | Sharma, Vivek | Choy, Peter | Hashemi, Sayed Hadi | Landon, Jessica | Finkelstein, Mara | Jhakra, Priya | Frye, Justin | Barnes, Megan | Mauger, Matthew | Daun, Dennis | Baatarsukh, Khuslen | Tung, Matthew | Farhan, Wael | Michalewski, Henryk | Viola, Fabio | Quitry, Felix de Chaumont | Lan, Charline Le | Hudson, Tom | Wang, Qingze | Fischer, Felix | Zheng, Ivy | White, Elspeth | Dragan, Anca | Alayrac, Jean-baptiste | Ni, Eric | Pritzel, Alexander | Iwanicki, Adam | Isard, Michael | Bulanova, Anna | Zilka, Lukas | Dyer, Ethan | Sachan, Devendra | Srinivasan, Srivatsan | Muckenhirn, Hannah | Cai, Honglong | Mandhane, Amol | Tariq, Mukarram | Rae, Jack W. | Wang, Gary | Ayoub, Kareem | FitzGerald, Nicholas | Zhao, Yao | Han, Woohyun | Alberti, Chris | Garrette, Dan | Krishnakumar, Kashyap | Gimenez, Mai | Levskaya, Anselm | Sohn, Daniel | Matak, Josip | Iturrate, Inaki | Chang, Michael B. | Xiang, Jackie | Cao, Yuan | Ranka, Nishant | Brown, Geoff | Hutter, Adrian | Mirrokni, Vahab | Chen, Nanxin | Yao, Kaisheng | Egyed, Zoltan | Galilee, Francois | Liechty, Tyler | Kallakuri, Praveen | Palmer, Evan | Ghemawat, Sanjay | Liu, Jasmine | Tao, David | Thornton, Chloe | Green, Tim | Jasarevic, Mimi | Lin, Sharon | Cotruta, Victor | Tan, Yi-Xuan | Fiedel, Noah | Yu, Hongkun | Chi, Ed | Neitz, Alexander | Heitkaemper, Jens | Sinha, Anu | Zhou, Denny | Sun, Yi | Kaed, Charbel | Hulse, Brice | Mishra, Swaroop | Georgaki, Maria | Kudugunta, Sneha | Farabet, Clement | Shafran, Izhak | Vlasic, Daniel | Tsitsulin, Anton | Ananthanarayanan, Rajagopal | Carin, Alen | Su, Guolong | Sun, Pei | V, Shashank | Carvajal, Gabriel | Broder, Josef | Comsa, Iulia | Repina, Alena | Wong, William | Chen, Warren Weilun | Hawkins, Peter | Filonov, Egor | Loher, Lucia | Hirnschall, Christoph | Wang, Weiyi | Ye, Jingchen | Burns, Andrea | Cate, Hardie | Wright, Diana Gage | Piccinini, Federico | Zhang, Lei | Lin, Chu-Cheng | Gog, Ionel | Kulizhskaya, Yana | Sreevatsa, Ashwin | Song, Shuang | Cobo, Luis C. | Iyer, Anand | Tekur, Chetan | Garrido, Guillermo | Xiao, Zhuyun | Kemp, Rupert | Zheng, Huaixiu Steven | Li, Hui | Agarwal, Ananth | Ngani, Christel | Goshvadi, Kati | Santamaria-Fernandez, Rebeca | Fica, Wojciech | Chen, Xinyun | Gorgolewski, Chris | Sun, Sean | Garg, Roopal | Ye, Xinyu | Eslami, S. M. Ali | Hua, Nan | Simon, Jon | Joshi, Pratik | Kim, Yelin | Tenney, Ian | Potluri, Sahitya | Thiet, Lam Nguyen | Yuan, Quan | Luisier, Florian | Chronopoulou, Alexandra | Scellato, Salvatore | Srinivasan, Praveen | Chen, Minmin | Koverkathu, Vinod | Dalibard, Valentin | Xu, Yaming | Saeta, Brennan | Anderson, Keith | Sellam, Thibault | Fernando, Nick | Huot, Fantine | Jung, Junehyuk | Varadarajan, Mani | Quinn, Michael | Raul, Amit | Le, Maigo | Habalov, Ruslan | Clark, Jon | Jalan, Komal | Bullard, Kalesha | Singhal, Achintya | Luong, Thang | Wang, Boyu | Rajayogam, Sujeevan | Eisenschlos, Julian | Jia, Johnson | Finchelstein, Daniel | Yakubovich, Alex | Balle, Daniel | Fink, Michael | Agarwal, Sameer | Li, Jing | Dvijotham, Dj | Pal, Shalini | Kang, Kai | Konzelmann, Jaclyn | Beattie, Jennifer | Dousse, Olivier | Wu, Diane | Crocker, Remi | Elkind, Chen | Jonnalagadda, Siddhartha Reddy | Lee, Jong | Holtmann-Rice, Dan | Kallarackal, Krystal | Liu, Rosanne | Vnukov, Denis | Vats, Neera | Invernizzi, Luca | Jafari, Mohsen | Zhou, Huanjie | Taylor, Lilly | Prendki, Jennifer | Wu, Marcus | Eccles, Tom | Liu, Tianqi | Kopparapu, Kavya | Beaufays, Francoise | Angermueller, Christof | Marzoca, Andreea | Sarcar, Shourya | Dib, Hilal | Stanway, Jeff | Perbet, Frank | Trdin, Nejc | Sterneck, Rachel | Khorlin, Andrey | Li, Dinghua | Wu, Xihui | Goenka, Sonam | Madras, David | Goldshtein, Sasha | Gierke, Willi | Zhou, Tong | Liu, Yaxin | Liang, Yannie | White, Anais | Li, Yunjie | Singh, Shreya | Bahargam, Sanaz | Epstein, Mark | Basu, Sujoy | Lao, Li | Ozturel, Adnan | Crous, Carl | Zhai, Alex | Lu, Han | Tung, Zora | Gaur, Neeraj | Walton, Alanna | Dixon, Lucas | Zhang, Ming | Globerson, Amir | Uy, Grant | Bolt, Andrew | Wiles, Olivia | Nasr, Milad | Shumailov, Ilia | Selvi, Marco | Piccinno, Francesco | Aguilar, Ricardo | McCarthy, Sara | Khalman, Misha | Shukla, Mrinal | Galic, Vlado | Carpenter, John | Villela, Kevin | Zhang, Haibin | Richardson, Harry | Martens, James | Bosnjak, Matko | Belle, Shreyas Rammohan | Seibert, Jeff | Alnahlawi, Mahmoud | McWilliams, Brian | Singh, Sankalp | Louis, Annie | Ding, Wen | Popovici, Dan | Simicich, Lenin | Knight, Laura | Mehta, Pulkit | Gupta, Nishesh | Shi, Chongyang | Fatehi, Saaber | Mitrovic, Jovana | Grills, Alex | Pagadora, Joseph | Petrova, Dessie | Eisenbud, Danielle | Zhang, Zhishuai | Yates, Damion | Mittal, Bhavishya | Tripuraneni, Nilesh | Assael, Yannis | Brovelli, Thomas | Jain, Prateek | Velimirovic, Mihajlo | Akbulut, Canfer | Mu, Jiaqi | Macherey, Wolfgang | Kumar, Ravin | Xu, Jun | Qureshi, Haroon | Comanici, Gheorghe | Wiesner, Jeremy | Gong, Zhitao | Ruddock, Anton | Bauer, Matthias | Felt, Nick | GP, Anirudh | Arnab, Anurag | Zelle, Dustin | Rothfuss, Jonas | Rosgen, Bill | Shenoy, Ashish | Seybold, Bryan | Li, Xinjian | Mudigonda, Jayaram | Erdogan, Goker | Xia, Jiawei | Simsa, Jiri | Michi, Andrea | Yao, Yi | Yew, Christopher | Kan, Steven | Caswell, Isaac | Radebaugh, Carey | Elisseeff, Andre | Valenzuela, Pedro | McKinney, Kay | Paterson, Kim | Cui, Albert | Latorre-Chimoto, Eri | Kim, Solomon | Zeng, William | Durden, Ken | Ponnapalli, Priya | Sosea, Tiberiu | Choquette-Choo, Christopher A. | Manyika, James | Robenek, Brona | Vashisht, Harsha | Pereira, Sebastien | Lam, Hoi | Velic, Marko | Owusu-Afriyie, Denese | Lee, Katherine | Bolukbasi, Tolga | Parrish, Alicia | Lu, Shawn | Park, Jane | Venkatraman, Balaji | Talbert, Alice | Rosique, Lambert | Cheng, Yuchung | Sozanschi, Andrei | Paszke, Adam | Kumar, Praveen | Austin, Jessica | Li, Lu | Salama, Khalid | Kim, Wooyeol | Dukkipati, Nandita | Baryshnikov, Anthony | Kaplanis, Christos | Sheng, XiangHai | Chervonyi, Yuri | Unlu, Caglar | Casas, Diego de Las | Askham, Harry | Tunyasuvunakool, Kathryn | Gimeno, Felix | Poder, Siim | Kwak, Chester | Miecnikowski, Matt | Dimitriev, Alek | Parisi, Aaron | Liu, Dangyi | Tsai, Tomy | Shevlane, Toby | Kouridi, Christina | Garmon, Drew | Goedeckemeyer, Adrian | Brown, Adam R. | Vijayakumar, Anitha | Elqursh, Ali | Jazayeri, Sadegh | Huang, Jin | Carthy, Sara Mc | Hoover, Jay | Kim, Lucy | Kumar, Sandeep | Chen, Wei | Biles, Courtney | Bingham, Garrett | Rosen, Evan | Wang, Lisa | Tan, Qijun | Engel, David | Pongetti, Francesco | de Cesare, Dario | Hwang, Dongseong | Yu, Lily | Pullman, Jennifer | Narayanan, Srini | Levin, Kyle | Gopal, Siddharth | Li, Megan | Aharoni, Asaf | Trinh, Trieu | Lo, Jessica | Casagrande, Norman | Vij, Roopali | Matthey, Loic | Ramadhana, Bramandia | Matthews, Austin | Carey, CJ | Johnson, Matthew | Goranova, Kremena | Shah, Rohin | Ashraf, Shereen | Dasgupta, Kingshuk | Larsen, Rasmus | Wang, Yicheng | Vuyyuru, Manish Reddy | Jiang, Chong | Ijazi, Joana | Osawa, Kazuki | Smith, Celine | Boppana, Ramya Sree | Bilal, Taylan | Koizumi, Yuma | Xu, Ying | Altun, Yasemin | Shabat, Nir | Bariach, Ben | Korchemniy, Alex | Choo, Kiam | Ronneberger, Olaf | Iwuanyanwu, Chimezie | Zhao, Shubin | Soergel, David | Hsieh, Cho-Jui | Cai, Irene | Iqbal, Shariq | Sundermeyer, Martin | Chen, Zhe | Bursztein, Elie | Malaviya, Chaitanya | Biadsy, Fadi | Shroff, Prakash | Dhillon, Inderjit | Latkar, Tejasi | Dyer, Chris | Forbes, Hannah | Nicosia, Massimo | Nikolaev, Vitaly | Greene, Somer | Georgiev, Marin | Wang, Pidong | Martin, Nina | Sedghi, Hanie | Zhang, John | Banzal, Praseem | Fritz, Doug | Rao, Vikram | Wang, Xuezhi | Zhang, Jiageng | Patraucean, Viorica | Du, Dayou | Mordatch, Igor | Jurin, Ivan | Liu, Lewis | Dubey, Ayush | Mohan, Abhi | Nowakowski, Janek | Ion, Vlad-Doru | Wei, Nan | Tojo, Reiko | Raad, Maria Abi | Hudson, Drew A. | Keshava, Vaishakh | Agrawal, Shubham | Ramirez, Kevin | Wu, Zhichun | Nguyen, Hoang | Liu, Ji | Sewak, Madhavi | Petrini, Bryce | Choi, DongHyun | Philips, Ivan | Wang, Ziyue | Bica, Ioana | Garg, Ankush | Wilkiewicz, Jarek | Agrawal, Priyanka | Guo, Danhao | Xue, Emily | Shaik, Naseer | Leach, Andrew | Khan, Sadh MNM | Wiesinger, Julia | Jerome, Sammy | Chakladar, Abhishek | Wang, Alek Wenjiao | Ornduff, Tina | Abu, Folake | Ghaffarkhah, Alireza | Wainwright, Marcus | Cortes, Mario | Liu, Frederick | Maynez, Joshua | Terzis, Andreas | Samangouei, Pouya | Mansour, Riham | Kępa, Tomasz | Aubet, François-Xavier | Algymr, Anton | Banica, Dan | Weisz, Agoston | Orban, Andras | Senges, Alexandre | Andrejczuk, Ewa | Geller, Mark | Santo, Niccolo Dal | Anklin, Valentin | Merey, Majd Al | Baeuml, Martin | Strohman, Trevor | Bai, Junwen | Petrov, Slav | Wu, Yonghui | Hassabis, Demis | Kavukcuoglu, Koray | Dean, Jeffrey | Vinyals, Oriol

Mar 8, 2024 – Gemini 1.5 introduces highly efficient multimodal models capable of recalling and reasoning over extensive context, surpassing previous versions and achieving near-perfect performance in tasks like long-document QA and long-video QA. The models demonstrate significant advancements in long-context understanding, with real-world applications showcasing substantial time savings and the ability to learn and translate rare languages effectively.

Code Llama: Open Foundation Models for Code

PageRank: 2,953
Growth: +1,218%
Citations: 1,333

Rozière, Baptiste | Gehring, Jonas | Gloeckle, Fabian | Sootla, Sten | Gat, Itai | Tan, Xiaoqing Ellen | Adi, Yossi | Liu, Jingyu | Sauvestre, Romain | Remez, Tal | Rapin, Jérémy | Kozhevnikov, Artyom | Evtimov, Ivan | Bitton, Joanna | Bhatt, Manish | Ferrer, Cristian Canton | Grattafiori, Aaron | Xiong, Wenhan | Défossez, Alexandre | Copet, Jade | Azhar, Faisal | Touvron, Hugo | Martin, Louis | Usunier, Nicolas | Scialom, Thomas | Synnaeve, Gabriel

Aug 24, 2023 – Code Llama is a family of large language models for code that provide state-of-the-art performance, infilling capabilities, and support for large input contexts. The models are available in different flavors and have been trained on sequences of 16k tokens, showing improvements on inputs with up to 100k tokens.

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

PageRank: 2,584
Growth: +1,155%
Citations: 1,817

Rafailov, Rafael | Sharma, Archit | Mitchell, Eric | Ermon, Stefano | Manning, Christopher D. | Finn, Chelsea

May 29, 2023 – This paper introduces a new method called Direct Preference Optimization (DPO) for fine-tuning large unsupervised language models (LMs) to align with human preferences. DPO is a stable and computationally lightweight algorithm that eliminates the need for complex reinforcement learning procedures, and it performs as well as or better than existing methods in controlling sentiment and improving response quality in language generation tasks.

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

PageRank: 10,618
Growth: +1,126%
Citations: 536

Abdin, Marah | Aneja, Jyoti | Awadalla, Hany | Awadallah, Ahmed | Awan, Ammar Ahmad | Bach, Nguyen | Bahree, Amit | Bakhtiari, Arash | Bao, Jianmin | Behl, Harkirat | Benhaim, Alon | Bilenko, Misha | Bjorck, Johan | Bubeck, Sébastien | Cai, Martin | Cai, Qin | Chaudhary, Vishrav | Chen, Dong | Chen, Dongdong | Chen, Weizhu | Chen, Yen-Chun | Chen, Yi-Ling | Cheng, Hao | Chopra, Parul | Dai, Xiyang | Dixon, Matthew | Eldan, Ronen | Fragoso, Victor | Gao, Jianfeng | Gao, Mei | Gao, Min | Garg, Amit | Del Giorno, Allie | Goswami, Abhishek | Gunasekar, Suriya | Haider, Emman | Hao, Junheng | Hewett, Russell J. | Hu, Wenxiang | Huynh, Jamie | Iter, Dan | Jacobs, Sam Ade | Javaheripi, Mojan | Jin, Xin | Karampatziakis, Nikos | Kauffmann, Piero | Khademi, Mahoud | Kim, Dongwoo | Kim, Young Jin | Kurilenko, Lev | Lee, James R. | Lee, Yin Tat | Li, Yuanzhi | Li, Yunsheng | Liang, Chen | Liden, Lars | Lin, Xihui | Lin, Zeqi | Liu, Ce | Liu, Liyuan | Liu, Mengchen | Liu, Weishung | Liu, Xiaodong | Luo, Chong | Madan, Piyush | Mahmoudzadeh, Ali | Majercak, David | Mazzola, Matt | Mendes, Caio César Teodoro | Mitra, Arindam | Modi, Hardik | Nguyen, Anh | Norick, Brandon | Patra, Barun | Perez-Becker, Daniel | Portet, Thomas | Pryzant, Reid | Qin, Heyang | Radmilac, Marko | Ren, Liliang | de Rosa, Gustavo | Rosset, Corby | Roy, Sambudha | Ruwase, Olatunji | Saarikivi, Olli | Saied, Amin | Salim, Adil | Santacroce, Michael | Shah, Shital | Shang, Ning | Sharma, Hiteshi | Shen, Yelong | Shukla, Swadheen | Song, Xia | Tanaka, Masahiro | Tupini, Andrea | Vaddamanu, Praneetha | Wang, Chunyu | Wang, Guanhua | Wang, Lijuan | Wang, Shuohang | Wang, Xin | Wang, Yu | Ward, Rachel | Wen, Wen | Witte, Philipp | Wu, Haiping | Wu, Xiaoxia | Wyatt, Michael | Xiao, Bin | Xu, Can | Xu, Jiahang | Xu, Weijian | Xue, Jilong | Yadav, Sonali | Yang, Fan | Yang, Jianwei | Yang, Yifan | Yang, Ziyi | Yu, Donghan | Yuan, Lu | Zhang, Chenruidong | Zhang, Cyril | Zhang, Jianwen | Zhang, Li Lyna | Zhang, Yi | Zhang, Yue | Zhang, Yunan | Zhou, Xiren

Apr 22, 2024 – The Phi-3 Technical Report introduces phi-3-mini, a highly capable language model with 3.8 billion parameters trained on 3.3 trillion tokens, achieving performance comparable to larger models like Mixtral 8x7B and GPT-3.5. Additionally, the report discusses the development of more advanced models in the phi-3 series, such as phi-3-small and phi-3-medium, as well as phi-3.5 models focusing on multilingual, multimodal, and long-context capabilities.

A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions

PageRank: 9,404
Growth: +1,007%
Citations: 549

Huang, Lei | Yu, Weijiang | Ma, Weitao | Zhong, Weihong | Feng, Zhangyin | Wang, Haotian | Chen, Qianglong | Peng, Weihua | Feng, Xiaocheng | Qin, Bing | Liu, Ting

Nov 9, 2023 – This survey explores the issue of hallucinations in large language models (LLMs), which produce content inconsistent with real-world facts or user inputs. The survey provides a taxonomy of LLM hallucinations, discusses factors contributing to hallucinations, presents detection methods and benchmarks, introduces approaches to mitigate hallucinations, and highlights challenges and open questions for future research.

AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation

PageRank: 9,775
Growth: +925%
Citations: 471

Wu, Qingyun | Bansal, Gagan | Zhang, Jieyu | Wu, Yiran | Li, Beibin | Zhu, Erkang | Jiang, Li | Zhang, Xiaoyun | Zhang, Shaokun | Liu, Jiale | Awadallah, Ahmed Hassan | White, Ryen W | Burger, Doug | Wang, Chi

Aug 16, 2023 – AutoGen is an open-source framework that allows developers to create LLM applications using multiple conversational agents. These agents can be customized and operate in different modes, and developers can define their interaction behaviors using natural language and code. The framework has been proven effective in various applications, including mathematics, coding, question answering, and entertainment.

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

PageRank: 5,553
Growth: +917%
Citations: 1,158

Podell, Dustin | English, Zion | Lacey, Kyle | Blattmann, Andreas | Dockhorn, Tim | Müller, Jonas | Penna, Joe | Rombach, Robin

Jul 4, 2023 – The researchers present SDXL, a latent diffusion model for text-to-image synthesis that improves upon previous versions by using a larger UNet backbone and introducing multiple novel conditioning schemes. They also introduce a refinement model to enhance the visual fidelity of generated samples and demonstrate that SDXL achieves competitive results with state-of-the-art image generators.

Zephyr: Direct Distillation of LM Alignment

PageRank: 10,869
Growth: +903%
Citations: 465

Tunstall, Lewis | Beeching, Edward | Lambert, Nathan | Rajani, Nazneen | Rasul, Kashif | Belkada, Younes | Huang, Shengyi | von Werra, Leandro | Fourrier, Clémentine | Habib, Nathan | Sarrazin, Nathan | Sanseviero, Omar | Rush, Alexander M. | Wolf, Thomas

Oct 25, 2023 – The researchers developed a language model called Zephyr-7B that is aligned to user intent by using preference data from AI Feedback. Zephyr-7B outperforms other models on chat benchmarks and does not require human annotation.

Llama 2: Open Foundation and Fine-Tuned Chat Models

PageRank: 413
Growth: +892%
Citations: 8,622

Touvron, Hugo | Martin, Louis | Stone, Kevin | Albert, Peter | Almahairi, Amjad | Babaei, Yasmine | Bashlykov, Nikolay | Batra, Soumya | Bhargava, Prajjwal | Bhosale, Shruti | Bikel, Dan | Blecher, Lukas | Ferrer, Cristian Canton | Chen, Moya | Cucurull, Guillem | Esiobu, David | Fernandes, Jude | Fu, Jeremy | Fu, Wenyin | Fuller, Brian | Gao, Cynthia | Goswami, Vedanuj | Goyal, Naman | Hartshorn, Anthony | Hosseini, Saghar | Hou, Rui | Inan, Hakan | Kardas, Marcin | Kerkez, Viktor | Khabsa, Madian | Kloumann, Isabel | Korenev, Artem | Koura, Punit Singh | Lachaux, Marie-Anne | Lavril, Thibaut | Lee, Jenya | Liskovich, Diana | Lu, Yinghai | Mao, Yuning | Martinet, Xavier | Mihaylov, Todor | Mishra, Pushkar | Molybog, Igor | Nie, Yixin | Poulton, Andrew | Reizenstein, Jeremy | Rungta, Rashi | Saladi, Kalyan | Schelten, Alan | Silva, Ruan | Smith, Eric Michael | Subramanian, Ranjan | Tan, Xiaoqing Ellen | Tang, Binh | Taylor, Ross | Williams, Adina | Kuan, Jian Xiang | Xu, Puxin | Yan, Zheng | Zarov, Iliyan | Zhang, Yuchen | Fan, Angela | Kambadur, Melanie | Narang, Sharan | Rodriguez, Aurelien | Stojnic, Robert | Edunov, Sergey | Scialom, Thomas

Jul 18, 2023 – The researchers have developed and released Llama 2, a collection of large language models optimized for dialogue use cases. These models outperform open-source chat models and are considered a suitable alternative to closed-source models, with a focus on safety and responsible development.

Qwen2 Technical Report

PageRank: 13,417
Growth: +887%
Citations: 460

Yang, An | Yang, Baosong | Hui, Binyuan | Zheng, Bo | Yu, Bowen | Zhou, Chang | Li, Chengpeng | Li, Chengyuan | Liu, Dayiheng | Huang, Fei | Dong, Guanting | Wei, Haoran | Lin, Huan | Tang, Jialong | Wang, Jialin | Yang, Jian | Tu, Jianhong | Zhang, Jianwei | Ma, Jianxin | Yang, Jianxin | Xu, Jin | Zhou, Jingren | Bai, Jinze | He, Jinzheng | Lin, Junyang | Dang, Kai | Lu, Keming | Chen, Keqin | Yang, Kexin | Li, Mei | Xue, Mingfeng | Ni, Na | Zhang, Pei | Wang, Peng | Peng, Ru | Men, Rui | Gao, Ruize | Lin, Runji | Wang, Shijie | Bai, Shuai | Tan, Sinan | Zhu, Tianhang | Li, Tianhao | Liu, Tianyu | Ge, Wenbin | Deng, Xiaodong | Zhou, Xiaohuan | Ren, Xingzhang | Zhang, Xinyu | Wei, Xipin | Ren, Xuancheng | Liu, Xuejing | Fan, Yang | Yao, Yang | Zhang, Yichang | Wan, Yu | Chu, Yunfei | Liu, Yuqiong | Cui, Zeyu | Zhang, Zhenru | Guo, Zhifang | Fan, Zhihao

Jul 15, 2024 – The Qwen2 series introduces advanced language and multimodal models with a wide parameter range, surpassing previous models in performance across various benchmarks. The flagship model, Qwen2-72B, demonstrates exceptional performance in language understanding, generation, and multilingual proficiency, with model weights openly available for community innovation and accessibility.

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

PageRank: 14,168
Growth: +882%
Citations: 396

Zhu, Lianghui | Liao, Bencheng | Zhang, Qian | Wang, Xinlong | Liu, Wenyu | Wang, Xinggang

Jan 17, 2024 – The paper introduces a new vision backbone called Vim, which uses bidirectional state space models instead of self-attention for visual representation learning. Vim achieves higher performance and improved computation and memory efficiency compared to existing vision transformers like DeiT, making it a promising backbone for vision foundation models.

A Mathematical Theory of Semantic Communication

PageRank: 5,309
Growth: +838%
Citations: 416

Niu, Kai | Zhang, Ping

Jan 24, 2024 – The paper introduces a new framework called semantic information theory (SIT) to explore semantic communication, focusing on synonymous mapping between semantic and syntactic information. It establishes measures such as semantic entropy, semantic mutual information, semantic capacity, and semantic rate-distortion function, proving coding theorems and extending the limits of SIT using synonymous mapping. Additionally, it discusses semantic information measures in the continuous case and derives a new channel capacity formula for the band-limited Gaussian channel.

GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints

PageRank: 9,213
Growth: +834%
Citations: 322

Ainslie, Joshua | Lee-Thorp, James | de Jong, Michiel | Zemlyanskiy, Yury | Lebrón, Federico | Sanghai, Sumit

May 22, 2023 – The researchers propose a method to train existing multi-head language models to have multi-query attention (MQA) using minimal computing resources. They also introduce grouped-query attention (GQA), a variation of MQA that achieves similar quality to multi-head attention while maintaining comparable speed to MQA.

Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond

PageRank: 14,652
Growth: +810%
Citations: 473

Bai, Jinze | Bai, Shuai | Yang, Shusheng | Wang, Shijie | Tan, Sinan | Wang, Peng | Lin, Junyang | Zhou, Chang | Zhou, Jingren

Aug 24, 2023 – The Qwen-VL series is a set of large-scale vision-language models designed to understand both texts and images. These models achieve state-of-the-art performance on various visual-centric benchmarks and outperform existing vision-language chatbots on real-world dialog benchmarks.

DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

PageRank: 16,435
Growth: +796%
Citations: 354

Guo, Daya | Zhu, Qihao | Yang, Dejian | Xie, Zhenda | Dong, Kai | Zhang, Wentao | Chen, Guanting | Bi, Xiao | Wu, Y. | Li, Y. K. | Luo, Fuli | Xiong, Yingfei | Liang, Wenfeng

Jan 25, 2024 – The DeepSeek-Coder series is a collection of open-source code models that have been trained on a large code corpus. These models outperform existing closed-source models and can be used for both research and commercial purposes.

VMamba: Visual State Space Model

PageRank: 17,407
Growth: +748%
Citations: 313

Liu, Yue | Tian, Yunjie | Zhao, Yuzhong | Yu, Hongtian | Xie, Lingxi | Wang, Yaowei | Ye, Qixiang | Liu, Yunfan

Jan 18, 2024 – The paper introduces VMamba, a vision backbone model that incorporates the Mamba state-space language model for efficient network architecture design in computer vision tasks. VMamba utilizes Visual State-Space (VSS) blocks with the 2D Selective Scan (SS2D) module to gather contextual information from various sources and perspectives, demonstrating promising performance and input scaling efficiency in diverse visual perception tasks.

Jailbreaking Black Box Large Language Models in Twenty Queries

PageRank: 16,424
Growth: +748%
Citations: 356

Chao, Patrick | Robey, Alexander | Dobriban, Edgar | Hassani, Hamed | Pappas, George J. | Wong, Eric

Oct 12, 2023 – The study introduces Prompt Automatic Iterative Refinement (PAIR), an algorithm that can generate semantic jailbreaks for large language models (LLMs) with only black-box access, inspired by social engineering attacks. PAIR is efficient, often requiring fewer than twenty queries to produce a jailbreak, and demonstrates competitive success rates on various LLMs, including GPT-3.5/4, Vicuna, and Gemini.

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

PageRank: 18,576
Growth: +740%
Citations: 274

Chiang, Wei-Lin | Zheng, Lianmin | Sheng, Ying | Angelopoulos, Anastasios Nikolas | Li, Tianle | Li, Dacheng | Zhang, Hao | Zhu, Banghua | Jordan, Michael | Gonzalez, Joseph E. | Stoica, Ion

Mar 6, 2024 – Chatbot Arena is an open platform designed to evaluate Large Language Models (LLMs) based on human preferences using a pairwise comparison approach and crowdsourced input. The platform has garnered over 240K votes, demonstrating its credibility and value as a widely referenced LLM leaderboard for developers and companies.

Retrieval-Augmented Generation for Large Language Models: A Survey

PageRank: 7,238
Growth: +736%
Citations: 632

Gao, Yunfan | Xiong, Yun | Gao, Xinyu | Jia, Kangxiang | Pan, Jinliu | Bi, Yuxi | Dai, Yi | Sun, Jiawei | Wang, Meng | Wang, Haofen

Dec 18, 2023 – The survey paper explores Retrieval-Augmented Generation (RAG) as a solution to challenges faced by Large Language Models (LLMs) by incorporating knowledge from external databases to enhance accuracy and credibility, particularly for knowledge-intensive tasks. It reviews the progression of RAG paradigms, including Naive RAG, Advanced RAG, and Modular RAG, while highlighting the state-of-the-art technologies and proposing future research directions and evaluation frameworks.

Baichuan 2: Open Large-scale Language Models

PageRank: 10,867
Growth: +731%
Citations: 487

Yang, Aiyuan | Xiao, Bin | Wang, Bingning | Zhang, Borong | Bian, Ce | Yin, Chao | Lv, Chenxu | Pan, Da | Wang, Dian | Yan, Dong | Yang, Fan | Deng, Fei | Wang, Feng | Liu, Feng | Ai, Guangwei | Dong, Guosheng | Zhao, Haizhou | Xu, Hang | Sun, Haoze | Zhang, Hongda | Liu, Hui | Ji, Jiaming | Xie, Jian | Dai, JunTao | Fang, Kun | Su, Lei | Song, Liang | Liu, Lifeng | Ru, Liyun | Ma, Luyao | Wang, Mang | Liu, Mickel | Lin, MingAn | Nie, Nuolan | Guo, Peidong | Sun, Ruiyang | Zhang, Tao | Li, Tianpeng | Li, Tianyu | Cheng, Wei | Chen, Weipeng | Zeng, Xiangrong | Wang, Xiaochuan | Chen, Xiaoxi | Men, Xin | Yu, Xin | Pan, Xuehai | Shen, Yanjun | Wang, Yiding | Li, Yiyu | Jiang, Youxin | Gao, Yuchen | Zhang, Yupeng | Zhou, Zenan | Wu, Zhiying

Sep 19, 2023 – Baichuan 2 is a series of large-scale multilingual language models that have been trained from scratch and contain billions of parameters. These models outperform other open-source models of similar size on various benchmarks and excel in vertical domains like medicine and law. The pre-training model checkpoints will be released to benefit the research community.

Loading papers...