SCALING LANGUAGE MODELS TO NEW HEIGHTS: A DEEP DIVE INTO MAJOR MODELS