Analyzing HTML Logs


#1

Hi there,

Thanks to google and the mirror of this site, I am now at the place where I’ve found my HTML logs and have downloaded it (it’s big) and I understand that in some way, one can read these logs to determine which pages are taking too long to load reading scripts and what not?

Does anyone have pointers on what I should be looking at?

Because honestly?

It’s meaningless and I need help bad.

Here is an example.
http://www.lalalalalal.com/forum/viewtopic.php?t=19&postdays=0&postorder=asc&start=280” "Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 5.1)"
70.55.218.20 - - [23/Mar/2007:13:30:40 -0700] “GET /forum/templates/lalalala/images/lace.jpg HTTP/1.1” 304 256

help me. I will be your best friend forever.


#2

Have a read of the Apache Log File Docs to learn how to read the raw log files.

Here is my translation of the log snippet you posted. Note you have the order around the wrong way, the IP address is the first line of a log entry, so you are actually looking at both the previous request, and the start of the next request.

This is the IP address of the client (remote host) which made the request to the server.

The time that the request was received. The “-0700” is the time zone that the apache server is set too.

This is the url requested by the client, GET is the method of the request, the next part is the requested resource, the HTTP/1.1 is the protocol used by the client performing the request.

This is the status code that the server sends back to the client. See Hypertext Transfer Protocol – HTTP/1.1 for more about status codes.

This is the size of the object sent to the client.

This is the referring url for this request, this would be the page that the visitor was reading, it looks to me that this page requests “lace.jpg” as part of its template.

The User-Agent HTTP request header. It shows your visitor is using Internet Explorer 7, on Windows XP.

Hope that helps you out.


Hot Product Directory | Get Around The Net Directory