c - Identifying dead code in large code repository - Stack Overflow-软件玩家

admin管理员组
文章数量:1122846

I have a large C code base, with >100 binaries, >3000 files and > 30 libraries. There is a lot of dead code that was accumulated and I'm looking for ways to identify and remove that code. The code is simple - no complex macros and (very little) automatically generated code (lex/bison/...).

To identify "static" dead code (and variables) gcc does a good job (using -Wunused-* options identifies all unused static variables, static functions, ...). My challenge is with non-static global functions and variables (and the code base has lot of them!)

I've lot of mileage using 'nm' across all the objects files, practically create a list of all defined global symbols (types 'T', 'D' and 'B' for code, data and uninitialized). I then removed every 'U' symbols. That process identified all unreferenced global. At this point, I have to manually make each symbol static, compile with gcc -Werror -Wunused, and see if it raises any error.

# Omitting some details for brevity.
nm --undefined-only lib1.a lib2.a ... obj1 obj2.o obj3.o | sort > refs.txt
nm --extern-only --defined-only lib1.a lib2.a ... obj1 obj2.o obj3.o  | sort > defs.txt
join -12 -23 -v2 refs.txt defs.txt

My question - is it possible to use "nm" (or other object analysis tool like objdump) to identify which global symbols in object file are also used inside the same object. This will speed up the dead code elimination by separating dead code in global function from global functions that are actually used (but may become static).

Alternatively, is there any other existing tool that will do the job?

# Omitting some details for brevity.
nm --undefined-only lib1.a lib2.a ... obj1 obj2.o obj3.o | sort > refs.txt
nm --extern-only --defined-only lib1.a lib2.a ... obj1 obj2.o obj3.o  | sort > defs.txt
join -12 -23 -v2 refs.txt defs.txt

Alternatively, is there any other existing tool that will do the job?

Share Improve this question edited yesterday mkrieger1 22.9k7 gold badges63 silver badges79 bronze badges asked yesterday dash-o 14.4k1 gold badge13 silver badges40 bronze badges

Make all the functions static and see what breaks :) – ikegami Commented yesterday
Can you run your code under a source coverage analysis tool? It cannot tell you which code are really dead, but can at least tell you which are not dead and you no longer need to change those symbols to static and test. Depending on your use case it may save a lot of time or not. – Weijun Zhou Commented yesterday
You should also consider looking for duplicate code. String functions in particular tend to accumulate on big projects. – stark Commented yesterday
As a lateral solution, consider using a test coverage tool (most professional environments have one available to their programmers/testers/QA anyway). Because instead of looking for unused code you might just identify untested code. If you find something that is not tested and no way to test it, then it is either dead or a design problem that needs fixing. (Depending on your context you might consider this an answer ... let me know, I will make one.) – Yunnosch Commented yesterday
A perl/python script may be able to find most cases. Regexp to find definition and for finding use. Note: assume some kind of human code (with some coding styles), do not try to do a tools which can handle all C cases (e.g. function definitions in a line with other statements), and forget preprocessors. Else you need a parser, and so no more a quick script – Giacomo Catenazzi Commented yesterday

Add a comment |

1 Answer 1

Sorted by: Reset to default 3

I suggest to use GNU ld's dead symbol removal functionality for this.

For this you need to compile your code with -fdata-sections -ffunction-sections and then link with -Wl,--gc-sections -Wl,--print-gc-sections flags. It will print information about functions which have been removed.

Here is an example for sample program

/usr/bin/ld: removing unused section '.text.foo' in file '/tmp/ccXZWJ2X.o'

(.text.foo is section generated for unused function foo).

As a side note, if you use these options there may be no need to manually sanitize your codebase (apart from making it cleaner) because the toolchain will remove dead code automatically.

本文标签： cIdentifying dead code in large code repositoryStack Overflow

版权声明：本文标题：c - Identifying dead code in large code repository - Stack Overflow 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1736281246a1926258.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

编程频道|软件玩家 - 软件改变生活！

c - Identifying dead code in large code repository - Stack Overflow

1 Answer 1

更多相关文章

c - Identifying dead code in large code repository - Stack Overflow

发表评论

推荐文章

jwt - Angular HttpClient: Failled to set any Header in an Http Request - Stack Overflow

plugins - How to make Wordpress ignore GET parameters when caching pages?

uploads - Get Specific Files (Only Specific Extension Type All Files in Loop ) from Media

react - Hook into viewport change?

python - strftime() alternating formats for the same placeholder - Stack Overflow

热门文章

Anaconda超详细教程2023710(windows)

另一个程序已锁定文件的一部分，进程无法访问打不开磁盘“D:安装包XNJWin10Windows 10.vmdk”或它所依赖的某个快照磁盘。模块“Disk”启动失败。未能启动虚拟机。

custom post types - browse by category and tags?

categories - Loop by category and meta_key

c++ - What is the standardpreferred way to write new extensions for Windbg (from the Microsoft Store)? - Stack Overflow

【原版ISO】Windows 10 22H2官方正式版2024年12月版

python - How to modify YOLOv9 to handle additional parameters in ground truth? - Stack Overflow

machine learning - How to fix access issues when trying to access VS code Via Azure Ai Studio project - Stack Overflow

python - Comfyui, how to create a custom node that dynamically updates the sub-options based on the selected main option - Stack

users - How to programatically change username (user_login)?

最新文章

Java入门级教学（IDEA的下载与安装与JDK的环境配置）

华硕笔记本电脑用U盘重装windows系统

物理网卡MAC修改器v3.0 - 真实网卡硬件MAC地址修改，重装系统不变！

如何一键安装win7系统(一键安装win7系统步骤)

Windows 11最稳定版本详解

winapi - Win32 DrawText() ignores text color set on the device context and draws text in background color - Stack Overflow

How to get Graalvm to convert AWT Java program to exe - Stack Overflow

Embedding of sequence of events sets - Stack Overflow

hcl - How to create parallel builds foreach item in list using packer template - Stack Overflow

react hooks - My browser localstorage clears everytime i refresh - Stack Overflow

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价