Dear all,
I'm trying to import into piwik our Apache logs but I have for all the lines of my log file a "invalid log lines".
Here is a line example of the log file:
My log format is :
And finaly here is my command line:
Does someone knows what is wrong in my regex ?
Many thanks
Marc AT
I'm trying to import into piwik our Apache logs but I have for all the lines of my log file a "invalid log lines".
Here is a line example of the log file:
www.ombudsman.europa.eu 136.173.180.12 136.173.180.12 [21/Nov/2014:13:32:35 +0100] 200 62218 2085387 "GET /lv/atyourservice/team.faces HTTP/1.1" "http://www.ombudsman.europa.eu/lt/atyourservice/team.faces" "Mozilla/5.0 (Macintosh; Intel Mac OS X 10.9; rv:32.0) Gecko/20100101 Firefox/32.0"
My log format is :
LogFormat "%V\t%h\t%h\t%t\t%>s\t%b\t%D\t\"%r\"\t\"%{Referer}i\"\t\"%{User-Agent}i\""
And finaly here is my command line:
python /pathToPiwik/misc/log-analytics/import_logs.py \ -d \ --url=http://localhost/piwik/ \ --dry-run \ --show-progress \ --add-sites-new-hosts \ --idsite=1 \ --enable-static \ --enable-bots \ --enable-http-errors \ --enable-http-redirects \ --enable-reverse-dns \ --replay-tracking \ --log-format-regex='(?P<host>\S+) (?P<ip>\S+) \S+ [(?P<date>.*?) (?P<timezone>.*?)\] (?P<status>\S+) (?P<length>\S+) (?P<session_time>\S+) "\S+ (?P<path>.*?) \S+" "(?P<referrer>.*?)" "(?P<user_agent>.*?)"' \ /pathToLogs/access.log
Does someone knows what is wrong in my regex ?
Many thanks
Marc AT