Android高手进阶——Android ANR解决案例(内部资料)

   

    当发生ANR后,首先需要查看log信息以及trace文件(系统都会在/data/anr/目录下生成trace文件)分析出ANR原因。通过以下分析并不能解决所有碰到的ANR,但程序自身原因导致的ANR问题基本都能找到原因。

  分析trace文件与Log文件,没有什么特别的技巧可言,更多的是经验积累。如果一个人无法准确的看出来具体原因,可以贴出来大家一起看看。因为,闭门造车是程序员的通病。


log信息分析


I/InputDispatcher( 220): Application is not responding:Window{2b263310com.android.email/com.android.email.activity.SplitScreenActivitypaused=false}. 5009.8ms since event, 5009.5ms since waitstarted --ANR时所处的界面
 I/WindowManager( 220): Input event dispatching timedout sendingtocom.android.email/com.android.email.activity.SplitScreenActivity --可看出ANR的类型 (KeyDispatchTimeout(5 seconds)、BroadcastTimeout(10 seconds)、ServiceTimeout(20 seconds) )
I/Process( 220): Sending signal. PID: 21404 SIG: 3---生成trace.txt的时间
I/dalvikvm(21404):threadid=4: reacting to signal 3
……
E/ActivityManager( 220): ANR in com.android.email(com.android.email/.activity.SplitScreenActivity)
E/ActivityManager( 220): Reason:keyDispatchingTimedOut
E/ActivityManager( 220): Load: 8.68 / 8.37 / 8.53
E/ActivityManager( 220):CPUusage from 4361ms to 699ms ago----CPU在ANR发生前的使用情况
E/ActivityManager( 220): 5.5!404/com.android.email: 1.3% user + 4.1% kernel / faults: 10 minor
E/ActivityManager( 220): 4.3"0/system_server: 2.7% user + 1.5% kernel / faults: 11 minor 2 major
E/ActivityManager( 220): 0.9R/spi_qsd.0: 0% user + 0.9% kernel
E/ActivityManager( 220): 0.5e/irq/170-cyttsp-: 0% user + 0.5% kernel
E/ActivityManager( 220): 0.5)6/com.android.systemui: 0.5% user + 0% kernel
E/ActivityManager( 220): 100%TOTAL: 4.8% user + 7.6% kernel + 87% iowait
E/ActivityManager( 220):CPUusage from 3697ms to 4223ms later:-- ANR后CPU的使用量
E/ActivityManager( 220): 25!404/com.android.email: 25% user + 0% kernel / faults: 191 minor
E/ActivityManager( 220): 16% 21603/__eas(par.hakan: 16% user + 0% kernel
E/ActivityManager( 220): 7.2% 21406/GC: 7.2% user + 0% kernel
E/ActivityManager( 220): 1.8% 21409/Compiler: 1.8% user + 0% kernel
E/ActivityManager( 220): 5.5"0/system_server: 0% user + 5.5% kernel / faults: 1 minor
E/ActivityManager( 220): 5.5% 263/InputDispatcher: 0% user + 5.5% kernel
E/ActivityManager( 220): 32%TOTAL: 28% user + 3.7% kernel




LOG可以看出ANR的类型,CPU的使用情况,如果CPU使用量接近100%,说明当前设备很忙,有可能是CPU饥饿导致了ANR

如果CPU使用量很少,说明主线程被BLOCK

如果IOwait很高,说明ANR有可能是主线程在进行I/O操作造成的

除了看LOG,解决ANR还得需要trace.txt文件,


trace文件分析



pid 21404 at 2011-04-0113:12:14 -----
Cmdline: com.android.email
DALVIK THREADS:
(mutexes: tll=0tsl=0 tscl=0 ghl=0 hwl=0 hwll=0)
"main" prio=5 tid=1NATIVE --表示线程状态 (
/​dalvik/​vm/​Thread.h
0042 enum ThreadStatus {
0043 THREAD_UNDEFINED = -1, /* makes enum compatible with int32_t */
0045 /* these match up with JDWP values */
0046 THREAD_ZOMBIE = 0, /* TERMINATED */
0047 THREAD_RUNNING = 1, /* RUNNABLE or running now */
0048 THREAD_TIMED_WAIT = 2, /* TIMED_WAITING in Object.wait() */
0049 THREAD_MONITOR = 3, /* BLOCKED on a monitor */
0050 THREAD_WAIT = 4, /* WAITING in Object.wait() */
0051 /* non-JDWP states */
0052 THREAD_INITIALIZING = 5, /* allocated, not yet running */
0053 THREAD_STARTING = 6, /* started, not yet on thread list */
0054 THREAD_NATIVE = 7, /* off in a JNI native method */
0055 THREAD_VMWAIT = 8, /* waiting on a VM resource */
0056 THREAD_SUSPENDED = 9, /* suspended, usually by GC or debugger */
0057 };
)
| group="main" sCount=1 dsCount=0obj=0x2aad2248 self=0xcf70
| sysTid=21404 nice=0 sched=0/0cgrp=[fopen-error:2] handle=1876218976
atandroid.os.MessageQueue.nativePollOnce(Native Method)
atandroid.os.MessageQueue.next(MessageQueue.java:119)
atandroid.os.Looper.loop(Looper.java:110)
at android.app.ActivityThread.main(ActivityThread.java:3688)
at java.lang.reflect.Method.invokeNative(Native Method)
atjava.lang.reflect.Method.invoke(Method.java:507)
atcom.android.internal.os.ZygoteInit$MethodAndArgsCaller.run(ZygoteInit.java:866)
at com.android.internal.os.ZygoteInit.main(ZygoteInit.java:624)
at dalvik.system.NativeStart.main(Native Method)



说明主线程在等待下条消息进入消息队列

我们可以很清楚的 解析 trace文件中 thread信息的含义了:

1. 第一行是 固定的头指明下面的都是 当前运行的 dvm thread “DALVIK THREADS:”

2. 第二行输出的是该 进程里各种线程互斥量的值。(具体的互斥量的作用在 dalvik 线程一章 单独陈述)

3. 第三行输出分别是 线程的名字(“main”),线程优先级(“prio=5”),线程id“tid=1”) 以及线程的 类型(“NATIVE”



案例

案例1:关键词:ContentResolver in AsyncTask onPostExecute, high iowait


Process:com.android.email
Activity:com.android.email/.activity.MessageView
Subject:keyDispatchingTimedOut
CPU usage from 2550ms to -2814ms ago:
57/system_server: 3.5% user + 1.4% kernel / faults: 86 minor 20major
4.4% 1134/com.android.email: 0.7% user + 3.7% kernel /faults: 38 minor 19 major
4% 372/com.android.eventstream: 0.7%user + 3.3% kernel / faults: 6 minor
1.1% 272/com.android.phone:0.9% user + 0.1% kernel / faults: 33 minor
0.9%2/com.android.systemui: 0.9% user + 0% kernel
0@9/com.android.eventstream.telephonyplugin: 0% user + 0% kernel /faults: 2 minor
0.1% 632/com.android.devicemonitor: 0.1% user + 0%kernel
100%TOTAL: 6.9% user + 8.2% kernel +84%iowait
 
pid 1134 at 2010-12-17 17:46:51 -----
Cmd line:com.android.email
DALVIK THREADS:
(mutexes: tll=0 tsl=0tscl=0 ghl=0 hwl=0 hwll=0)
"main" prio=5 tid=1 WAIT
|group="main" sCount=1 dsCount=0 obj=0x2aaca180self=0xcf20
| sysTid=1134 nice=0 sched=0/0 cgrp=[fopen-error:2]handle=1876218976
at java.lang.Object.wait(Native Method)
-waiting on <0x2aaca218> (a java.lang.VMThread)
atjava.lang.Thread.parkFor(Thread.java:1424)
atjava.lang.LangAccessImpl.parkFor(LangAccessImpl.java:48)
atsun.misc.Unsafe.park(Unsafe.java:337)
atjava.util.concurrent.locks.LockSupport.park(LockSupport.java:157)
atjava.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:808)
atjava.util.concurrent.locks.AbstractQueuedSynchronizer.acquireQueued(AbstractQueuedSynchronizer.java:841)
atjava.util.concurrent.locks.AbstractQueuedSynchronizer.acquire(AbstractQueuedSynchronizer.java:1171)
atjava.util.concurrent.locks.ReentrantLock$FairSync.lock(ReentrantLock.java:200)
atjava.util.concurrent.locks.ReentrantLock.lock(ReentrantLock.java:261)
atandroid.database.sqlite.SQLiteDatabase.lock(SQLiteDatabase.java:378)
atandroid.database.sqlite.SQLiteCursor.<init>(SQLiteCursor.java:222)
atandroid.database.sqlite.SQLiteDirectCursorDriver.query(SQLiteDirectCursorDriver.java:53)
atandroid.database.sqlite.SQLiteDatabase.rawQueryWithFactory(SQLiteDatabase.java:1356)
atandroid.database.sqlite.SQLiteDatabase.queryWithFactory(SQLiteDatabase.java:1235)
atandroid.database.sqlite.SQLiteDatabase.query(SQLiteDatabase.java:1189)
atandroid.database.sqlite.SQLiteDatabase.query(SQLiteDatabase.java:1271)
atcom.android.email.provider.EmailProvider.query(EmailProvider.java:1098)
atandroid.content.ContentProvider$Transport.query(ContentProvider.java:187)
atandroid.content.ContentResolver.query(ContentResolver.java:268)
atcom.android.email.provider.EmailContent$Message.restoreMessageWithId(EmailContent.java:648)
atcom.android.email.Controller.setMessageRead(Controller.java:658)
atcom.android.email.activity.MessageView.onMarkAsRead(MessageView.java:700)
atcom.android.email.activity.MessageView.access$2500(MessageView.java:98)
atcom.android.email.activity.MessageView$LoadBodyTask.onPostExecute(MessageView.java:1290)
atcom.android.email.activity.MessageView$LoadBodyTask.onPostExecute(MessageView.java:1255)
atandroid.os.AsyncTask.finish(AsyncTask.java:417)
atandroid.os.AsyncTask.access$300(AsyncTask.java:127)
atandroid.os.AsyncTask$InternalHandler.handleMessage(AsyncTask.java:429)
atandroid.os.Handler.dispatchMessage(Handler.java:99)
atandroid.os.Looper.loop(Looper.java:123)
atandroid.app.ActivityThread.main(ActivityThread.java:3652)
atjava.lang.reflect.Method.invokeNative(Native Method)
atjava.lang.reflect.Method.invoke(Method.java:507)
atcom.android.internal.os.ZygoteIn



原因:IOWait很高,说明当前系统在忙于I/O,因此数据库操作被阻塞




案例2:关键词:UI线程进行网络数据的读写


ANRin process: com.android.mediascape:PhotoViewer (last incom.android.mediascape:PhotoViewer)
Annotation:keyDispatchingTimedOut
CPU usage:
Load: 6.74 / 6.89 / 6.12
CPUusage from 8254ms to 3224ms ago:
ovider.webmedia: 4% = 4% user +0% kernel / faults: 68 minor
system_server: 2% = 1% user + 0%kernel / faults: 18 minor
re-initialized>: 0% = 0% user + 0%kernel / faults: 50 minor
events/0: 0% = 0% user + 0%kernel
TOTAL:7% = 6% user + 1% kernel
DALVIKTHREADS:
""main"" prio=5 tid=3 NATIVE
|group=""main"" sCount=1 dsCount=0 s=Yobj=0x4001b240 self=0xbda8
| sysTid=2579 nice=0 sched=0/0cgrp=unknown handle=-1343993184
atorg.apache.harmony.luni.platform.OSNetworkSystem.receiveStreamImpl(NativeMethod)
atorg.apache.harmony.luni.platform.OSNetworkSystem.receiveStream(OSNetworkSystem.java:478)
atorg.apache.harmony.luni.net.PlainSocketImpl.read(PlainSocketImpl.java:565)
atorg.apache.harmony.luni.net.SocketInputStream.read(SocketInputStream.java:87)
atorg.apache.harmony.luni.internal.net.www.protocol.http.HttpURLConnection$LimitedInputStream.read(HttpURLConnection.java:303)
atjava.io.InputStream.read(InputStream.java:133)
atjava.io.BufferedInputStream.fillbuf(BufferedInputStream.java:157)
atjava.io.BufferedInputStream.read(BufferedInputStream.java:346)
atandroid.graphics.BitmapFactory.nativeDecodeStream(Native Method)
atandroid.graphics.BitmapFactory.decodeStream(BitmapFactory.java:459)
atcom.android.mediascape.activity.PhotoViewerActivity.getPreviewImage(PhotoViewerActivity.java:4465)
atcom.android.mediascape.activity.PhotoViewerActivity.dispPreview(PhotoViewerActivity.java:4406)
atcom.android.mediascape.activity.PhotoViewerActivity.access$6500(PhotoViewerActivity.java:125)
atcom.android.mediascape.activity.PhotoViewerActivity$33$1.run(PhotoViewerActivity.java:4558)
atandroid.os.Handler.handleCallback(Handler.java:587)
atandroid.os.Handler.dispatchMessage(Handler.java:92)
atandroid.os.Looper.loop(Looper.java:123)
atandroid.app.ActivityThread.main(ActivityThread.java:4370)
atjava.lang.reflect.Method.invokeNative(Native Method)
atjava.lang.reflect.Method.invoke(Method.java:521)
atcom.android.internal.os.ZygoteInit$MethodAndArgsCaller.run(ZygoteInit.java:868)
atcom.android.internal.os.ZygoteInit.main(ZygoteInit.java:626)
atdalvik.system.NativeStart.main(Native Method)



关于网络连接,在设计的时候可以设置个timeout的时间或者放入独立的线程来处理。

关于Handler的问题,可以参考: http://developer.android.com/reference/android/os/Handler.html





案例3Memoryleak/Thread leak


I/ActivityManager( 1190): ANR in process:android.process.acore (last in android.process.acore)
I/ActivityManager( 1190): Annotation:keyDispatchingTimedOut
I/ActivityManager(1190): CPU usage:
I/ActivityManager( 1190):Load: 11.5 / 11.1 / 11.09
I/ActivityManager(1190): CPU usage from 9046ms to 4018ms ago:
I/ActivityManager( 1190): d.process.acore:98% = 97% user + 0% kernel / faults: 1134 minor
I/ActivityManager( 1190): system_server: 0% = 0% user + 0% kernel /faults: 1 minor
I/ActivityManager( 1190): adbd:0% = 0% user + 0% kernel
I/ActivityManager(1190): logcat: 0% = 0% user + 0% kernel
I/ActivityManager( 1190): TOTAL:100% = 98% user + 1% kernel
Cmdline: android.process.acore
DALVIK THREADS:
"main"prio=5 tid=3 VMWAIT
|group="main" sCount=1 dsCount=0 s=N obj=0x40026240self=0xbda8
| sysTid=1815 nice=0 sched=0/0 cgrp=unknownhandle=-1344001376
atdalvik.system.VMRuntime.trackExternalAllocation(NativeMethod)
atandroid.graphics.Bitmap.nativeCreate(Native Method)
atandroid.graphics.Bitmap.createBitmap(Bitmap.java:468)
atandroid.view.View.buildDrawingCache(View.java:6324)
atandroid.view.View.getDrawingCache(View.java:6178)
atandroid.view.ViewGroup.drawChild(ViewGroup.java:1541)
……
atcom.android.internal.policy.impl.PhoneWindow$DecorView.draw(PhoneWindow.java:1830)
atandroid.view.ViewRoot.draw(ViewRoot.java:1349)
atandroid.view.ViewRoot.performTraversals(ViewRoot.java:1114)
atandroid.view.ViewRoot.handleMessage(ViewRoot.java:1633)
atandroid.os.Handler.dispatchMessage(Handler.java:99)
atandroid.os.Looper.loop(Looper.java:123)
atandroid.app.ActivityThread.main(ActivityThread.java:4370)
atjava.lang.reflect.Method.invokeNative(Native Method)
atjava.lang.reflect.Method.invoke(Method.java:521)
atcom.android.internal.os.ZygoteInit$MethodAndArgsCaller.run(ZygoteInit.java:868)
atcom.android.internal.os.ZygoteInit.main(ZygoteInit.java:626)
atdalvik.system.NativeStart.main(Native Method)
"Thread-408"prio=5 tid=329 WAIT
|group="main" sCount=1 dsCount=0 s=N obj=0x46910d40self=0xcd0548
| sysTid=10602 nice=0 sched=0/0 cgrp=unknownhandle=15470792
at java.lang.Object.wait(Native Method)
-waiting on <0x468cd420> (a java.lang.Object)
atjava.lang.Object.wait(Object.java:288)
atcom.android.dialer.CallLogContentHelper$UiUpdaterExecutor$1.run(CallLogContentHelper.java:289)
atjava.lang.Thread.run(Thread.java:1096)


分析:
atdalvik.system.VMRuntime.trackExternalAllocation(NativeMethod)内存不足导致block在创建bitmap上
*MEMINFO in pid 1360 [android.process.acore] *
native dalvik other total


size: 17036 23111 N/A 40147
allocated: 16484 20675 N/A 37159
free: 296 2436 N/A 2732


解决:如果机器的内存族,可以修改虚拟机的内存为36M或更大,不过最好是复查代码,查看哪些内存没有释放

 



/**
 * @author zhoushengtao(周圣韬)
 * @since 2014年12月1日 下午6:10:39
 * @weixin stchou_zst Android高手进阶——Android ANR解决案例(内部资料)_第1张图片

 * @blog http://blog.csdn.net/yzzst

 * @交流学习QQ群:341989536

 * @私人QQ:445914891
 */




你可能感兴趣的:(Android高手进阶——Android ANR解决案例(内部资料))