Looking into Black Box Code Language Models

Date:

arXiv:2407.04868v1 Announce Type: new
Abstract: Language Models (LMs) have shown their application for tasks pertinent to code and several code~LMs have been proposed recently. The majority of the studies in this direction only focus on the improvements in performance of the LMs on different benchmarks, whereas LMs are considered black boxes. Besides this, a handful of works attempt to understand the role of attention layers in the code~LMs. Nonetheless, feed-forward layers remain under-explored which consist of two-thirds of a typical transformer model’s parameters.
In this work, we attempt to gain insights into the inner workings of code language models by examining the feed-forward layers. To conduct our investigations, we use two state-of-the-art code~LMs, Codegen-Mono and Ploycoder, and three widely used programming languages, Java, Go, and Python. We focus on examining the organization of stored concepts, the editability of these concepts, and the roles of different layers and input context size variations for output generation. Our empirical findings demonstrate that lower layers capture syntactic patterns while higher layers encode abstract concepts and semantics. We show concepts of interest can be edited within feed-forward layers without compromising code~LM performance. Additionally, we observe initial layers serve as “thinking” layers, while later layers are crucial for predicting subsequent code tokens. Furthermore, we discover earlier layers can accurately predict smaller contexts, but larger contexts need critical later layers’ contributions. We anticipate these findings will facilitate better understanding, debugging, and testing of code~LMs.

Share post:

Subscribe

Popular

More like this
Related

H2 Clipper 항공 우주 제조에 로봇 떼를 배치 할 계획

Swarm Robotics에 의해 구동되는 미래의 항공 우주 제조 시설의...

서비스 로봇 공학은 Dallas에 자율 전달 로봇을 제공합니다

Serv의 최신 배송 로봇은 NVIDIA의 Jetson Orin 모듈을 사용하여...

Sanctuary AI는 강화 학습이 어떻게 유압 로봇 손을 제어 할 수 있는지 보여줍니다.

Sanctuary AI의 독점 로봇 그립퍼는 많은 활성의 자유도로 구분됩니다....

우크라이나의 드론이 러시아의 재밍을 때리는 방식

에스토니아 스타트 업 후 Krattworks 첫 번째 배치를 파견했습니다...