Kaltura CE 11.16 LTS 14.04 Analytic Corrections

Hi all,

For DHW / Pentaho I need to make on Ubuntu a few corrections for Apache Logs Behaviour as mentioned here:
http://www.panda-os.com/tag/kaltura-dwh/#.V3RIm1e5nMU
and after all I found out the path for the Kitchen Scripts hourly, daily etc was not correct set, so I have corrected this to

#!/bin/bash
. /etc/kaltura.d/system.ini

KITCHEN=$BASE_DIR/pentaho/pdi/kitchen.sh
ROOT_DIR=$BASE_DIR/dwh
WHEN=$(date +%Y%m%d-%H)

Now I am receiving correct log out puts for Statistics / Analytics but If am checking the DB Table dwh_fact_events after I made some plays, where also get recgonized in the logs under /opt/kaltura/logs kaltura_apache_access.log and events.log correctly, the table is still empty and no errors in error log…

Any advice?

root@video:~# grep ERROR /opt/kaltura/dwh/logs/*
/opt/kaltura/dwh/logs/etl_daily-20160630.log:ERROR 30-06 14:43:42,738 - perform aggregations - org.pentaho.di.core.exception.KettleException:
/opt/kaltura/dwh/logs/etl_daily-20160630.log:ERROR 30-06 14:43:42,740 - Run kjbVar job - An error occurred executing this job entry :
/opt/kaltura/dwh/logs/etl_daily-20160630.log:ERROR 30-06 14:43:42,741 - Run kjbVar job - org.pentaho.di.core.exception.KettleException:
/opt/kaltura/dwh/logs/etl_daily-20160630.log:ERROR 30-06 14:43:42,771 - Abort job 1 - Aborting job.
/opt/kaltura/dwh/logs/etl_hourly-20160630-14.log:ERROR 30-06 14:43:22,613 - process_events - org.pentaho.di.core.exception.KettleException:
/opt/kaltura/dwh/logs/etl_hourly-20160630-14.log:ERROR 30-06 14:43:22,616 - file_processing_job - An error occurred executing this job entry :
/opt/kaltura/dwh/logs/etl_hourly-20160630-14.log:ERROR 30-06 14:43:22,616 - file_processing_job - org.pentaho.di.core.exception.KettleException:
/opt/kaltura/dwh/logs/etl_perform_retention_policy-20160630.log:ERROR 30-06 14:43:55,572 - Move old partitions to archive - An error occurred executing this job entry :
/opt/kaltura/dwh/logs/etl_perform_retention_policy-20160630.log:ERROR 30-06 14:43:55,599 - Kitchen - Finished with errors
/opt/kaltura/dwh/logs/etl_update_dims-20160630-14.log:ERROR 30-06 14:43:32,736 - Mapping (sub-transformation) - Unable to load the mapping transformation because of an error : org.pentaho.di.core.exception.KettleException:
/opt/kaltura/dwh/logs/etl_update_dims-20160630-14.log:ERROR 30-06 14:43:32,738 - Mapping (sub-transformation) - org.pentaho.di.core.exception.KettleException:
/opt/kaltura/dwh/logs/etl_update_dims-20160630-14.log:ERROR 30-06 14:43:32,738 - Mapping (sub-transformation) - Error initializing step [Mapping (sub-transformation)]
/opt/kaltura/dwh/logs/etl_update_dims-20160630-14.log:ERROR 30-06 14:43:32,754 - Mapping (sub-transformation) - Unable to load the mapping transformation because of an error : org.pentaho.di.core.exception.KettleException:
/opt/kaltura/dwh/logs/etl_update_dims-20160630-14.log:ERROR 30-06 14:43:32,756 - Mapping (sub-transformation) - org.pentaho.di.core.exception.KettleException:
/opt/kaltura/dwh/logs/etl_update_dims-20160630-14.log:ERROR 30-06 14:43:32,756 - Mapping (sub-transformation) - Error initializing step [Mapping (sub-transformation)]
/opt/kaltura/dwh/logs/etl_update_dims-20160630-14.log:ERROR 30-06 14:43:32,756 - iterate_sequences - Step [Mapping (sub-transformation).0] failed to initialize!
/opt/kaltura/dwh/logs/etl_update_dims-20160630-14.log:ERROR 30-06 14:43:32,757 - iterate_sequences - Step [Mapping (sub-transformation).1] failed to initialize!
/opt/kaltura/dwh/logs/etl_update_dims-20160630-14.log:ERROR 30-06 14:43:32,758 - Iterate sequences - Unable to prepare for execution of the transformation
/opt/kaltura/dwh/logs/etl_update_dims-20160630-14.log:ERROR 30-06 14:43:32,759 - Iterate sequences - org.pentaho.di.core.exception.KettleException:
/opt/kaltura/dwh/logs/log_aggregation_perform_aggregations.log:2016/06/30 14:43:42 - perform aggregations - ERROR (version 4.2.1-stable, build 15952 from 2011-10-25 15.27.10 by buildguy) : org.pentaho.di.core.exception.KettleException:
/opt/kaltura/dwh/logs/log_aggregation_perform_aggregations.log:2016/06/30 14:43:42 - perform aggregations - ERROR (version 4.2.1-stable, build 15952 from 2011-10-25 15.27.10 by buildguy) : Unexpected error during transformation metadata load
/opt/kaltura/dwh/logs/log_aggregation_perform_aggregations.log:2016/06/30 14:43:42 - perform aggregations - ERROR (version 4.2.1-stable, build 15952 from 2011-10-25 15.27.10 by buildguy) :
/opt/kaltura/dwh/logs/log_aggregation_perform_aggregations.log:2016/06/30 14:43:42 - perform aggregations - ERROR (version 4.2.1-stable, build 15952 from 2011-10-25 15.27.10 by buildguy) : Error reading object from XML file
/opt/kaltura/dwh/logs/log_aggregation_perform_aggregations.log:2016/06/30 14:43:42 - perform aggregations - ERROR (version 4.2.1-stable, build 15952 from 2011-10-25 15.27.10 by buildguy) :
/opt/kaltura/dwh/logs/log_aggregation_perform_aggregations.log:2016/06/30 14:43:42 - perform aggregations - ERROR (version 4.2.1-stable, build 15952 from 2011-10-25 15.27.10 by buildguy) : Unable to load step info from XML step nodeorg.pentaho.di.core.exception.KettleStepLoaderException:
/opt/kaltura/dwh/logs/log_aggregation_perform_aggregations.log:2016/06/30 14:43:42 - perform aggregations - ERROR (version 4.2.1-stable, build 15952 from 2011-10-25 15.27.10 by buildguy) : Unable to load class for step/plugin with id [MapperFieldRunner]. Check if the plugin is available in the plugins subdirectory of the Kettle distribution.
/opt/kaltura/dwh/logs/log_aggregation_perform_aggregations.log:2016/06/30 14:43:42 - perform aggregations - ERROR (version 4.2.1-stable, build 15952 from 2011-10-25 15.27.10 by buildguy) :
/opt/kaltura/dwh/logs/log_aggregation_perform_aggregations.log:2016/06/30 14:43:42 - perform aggregations - ERROR (version 4.2.1-stable, build 15952 from 2011-10-25 15.27.10 by buildguy) :
/opt/kaltura/dwh/logs/log_aggregation_perform_aggregations.log:2016/06/30 14:43:42 - perform aggregations - ERROR (version 4.2.1-stable, build 15952 from 2011-10-25 15.27.10 by buildguy) : Unable to load class for step/plugin with id [MapperFieldRunner]. Check if the plugin is available in the plugins subdirectory of the Kettle distribution.
/opt/kaltura/dwh/logs/log_aggregation_perform_aggregations.log:2016/06/30 14:43:42 - perform aggregations - ERROR (version 4.2.1-stable, build 15952 from 2011-10-25 15.27.10 by buildguy) :
/opt/kaltura/dwh/logs/log_aggregation_perform_aggregations.log:2016/06/30 14:43:42 - perform aggregations - ERROR (version 4.2.1-stable, build 15952 from 2011-10-25 15.27.10 by buildguy) :
/opt/kaltura/dwh/logs/log_aggregation_perform_aggregations.log:2016/06/30 14:43:42 - perform aggregations - ERROR (version 4.2.1-stable, build 15952 from 2011-10-25 15.27.10 by buildguy) :
/opt/kaltura/dwh/logs/log_aggregation_perform_aggregations.log:2016/06/30 14:43:42 - perform aggregations - ERROR (version 4.2.1-stable, build 15952 from 2011-10-25 15.27.10 by buildguy) :
/opt/kaltura/dwh/logs/log_aggregation_perform_aggregations.log:2016/06/30 14:43:42 - perform aggregations - ERROR (version 4.2.1-stable, build 15952 from 2011-10-25 15.27.10 by buildguy) : at org.pentaho.di.job.entries.trans.JobEntryTrans.getTransMeta(JobEntryTrans.java:1195)

Hello,

Seems like either some plugins under /opt/kaltura/pentaho/pdi/plugins or there is a permission issue.
Try running /opt/kaltura/bin/kaltura-run-dwh.sh as root and look at the logs, does that work?
Also, make sure you have no left over lock files, see:

Hello,
atm we haven’t any locked files anymore, and script pass very smooth, but still doesn’t create any DB entry

Hi,

Did you look at https://github.com/kaltura/platform-install-packages/blob/Kajam-11.18.0/doc/kaltura-packages-faq.md#analytics-issues? did you make these queries? please post what each returns.
For example:

mysql> select * from kalturadw_ds.files where insert_time >=%Y%m%d;

should return the archive paths for the apache access logs that DWH went though, does it?
and in these access files, does it have the action=collect lines?

Note that plays made within the KMC will never be counted in Analytics.

Hi,

Result is, without complete path in DB, just with file name, but files will land after they are processed in folders like:
/opt/kaltura/dwh/cycles/process/10/video-kaltura_apache_access.log-20160701-06

1 video-kaltura_apache_access.log-20160622-06 IN_CYCLE 2016-06-27 12:02:06 1018 1 1 gz
2 video-kaltura_apache_access.log-20160623-06 IN_CYCLE 2016-06-27 12:02:06 539 1 1 gz
3 video-kaltura_apache_access.log-20160624-06 IN_CYCLE 2016-06-27 12:02:06 457 1 1 gz
4 video-kaltura_apache_access.log-20160625-06 IN_CYCLE 2016-06-27 12:02:06 532 1 1 gz
5 video-kaltura_apache_access.log-20160626-06 IN_CYCLE 2016-06-27 12:02:06 870 1 1 gz
6 video-kaltura_apache_access.log-20160627-06 IN_CYCLE 2016-06-27 12:02:06 283 1 1 gz
7 video-kaltura_apache_access.log-20160627-12 IN_CYCLE 2016-06-27 12:02:06 12 1 1 gz
8 video-kaltura_apache_access.log-20160627-21 IN_CYCLE 2016-06-27 21:49:13 13 1 2 gz
9 video-kaltura_apache_access.log-20160627-22 IN_CYCLE 2016-06-27 22:05:57 14 1 3 gz
10 video-kaltura_apache_access.log-20160628-06 IN_CYCLE 2016-06-30 13:46:54 248 1 4 gz
11 video-kaltura_apache_access.log-20160629-06 IN_CYCLE 2016-06-30 13:46:54 275 1 4 gz
12 video-kaltura_apache_access.log-20160630-06 IN_CYCLE 2016-06-30 13:46:54 643 1 4 gz
13 video-kaltura_apache_access.log-20160630-14 IN_CYCLE 2016-06-30 14:43:20 43 1 5 gz
14 video-kaltura_apache_access.log-20160630-16 IN_CYCLE 2016-06-30 16:14:18 92 1 6 gz
15 video-kaltura_apache_access.log-20160630-21 IN_CYCLE 2016-06-30 22:00:14 3 1 7 gz
16 video-kaltura_apache_access.log-20160630-22 IN_CYCLE 2016-06-30 22:02:43 141 1 8 gz
17 video-kaltura_apache_access.log-20160630-23 IN_CYCLE 2016-06-30 23:03:57 23 1 9 gz
18 video-kaltura_apache_access.log-20160701-06 IN_CYCLE 2016-07-01 11:06:56 171 1 10 gz
19 video-kaltura_apache_access.log-20160701-11 IN_CYCLE 2016-07-01 11:06:56 100 1 10 gz

and files does include collect statements:

109.70.113.100 - - [30/Jun/2016:23:17:34 +0200] “GET /api_v3/index.php?service=stats&action=kmcCollect&kalsig=a420284a3a8c5cf4cb55cece6c4a7b55&ks=YjA1ZjdhYWQ4ODQ1YTMxYTc5YTUzNTA2OWFiMDZjZmFkNzc0MTRjZXwxMDE7MTAxOzE0Njc0MDQ4NjM7MjsxNDY3MzE4NDYzLjIwMzg7YWdAcXVhZGNvbS5jaDtkaXNhYmxlZW50aXRsZW1lbnQ7Ow%3D%3D&kmcEvent%3AeventTimestamp=1467321454%2E427&ignoreNull=1&kmcEvent%3AobjectType=KalturaStatsKmcEvent&kmcEvent%3ApartnerId=101&clientTag=kmc%3A5%2E40%2E2&kmcEvent%3AclientVer=1%2E0%3AKMC%5FANDROMEDA%2FReports%20and%20Analytics%3A4&kmcEvent%3AkmcEventType=1071&apiVersion=3%2E1%2E5&kmcEvent%3AkmcEventActionPath=Main%20Menu%3EContents%20and%20Reports%20Tab&partnerId=101 HTTP/1.1” 200 607 0/53411 “http://video.quadcom.ch/index.php/kmc/kmc4” “Mozilla/5.0 (Macintosh; Intel Mac OS X 10_11_4) AppleWebKit/601.5.17 (KHTML, like Gecko) Version/9.1 Safari/601.5.17” “-” 109.70.113.100 “-” “video.quadcom.ch” 25009 862617523 + 1251 “-” “-” “-” “-” “no-store, no-cache, must-revalidate, post-check=0, pre-check=0” 101
109.70.113.100 - - [30/Jun/2016:23:50:08 +0200] “GET /api_v3/index.php?service=stats&apiVersion=3.1&expiry=86400&clientTag=kwidget%3Av2.43&format=1&ignoreNull=1&action=collect&event:eventType=2&event:clientVer=2.43&event:currentPoint=0&event:duration=2790&event:eventTimestamp=1467323408865&event:isFirstInSession=false&event:objectType=KalturaStatsEvent&event:partnerId=101&event:sessionId=81b62913-e119-f82e-6cf9-323b72bf5285&event:uiconfId=23448214&event:seek=false&event:entryId=0_inp6xswf&event:widgetId=_101&event:referrer=http%253A%252F%252Fvideo.quadcom.ch%252Findex.php%252Fextwidget%252Fpreview%252Fpartner_id%252F101%252Fuiconf_id%252F23448214%252Fentry_id%252F0_inp6xswf%252Fembed%252Fdynamic%253F%2526flashvars%255BstreamerType%255D%253Dauto&kalsig=9580db4f35e414c99dad27fa3bb7e377 HTTP/1.1” 200 458 0/22779 “http://video.quadcom.ch/index.php/extwidget/preview/partner_id/101/uiconf_id/23448214/entry_id/0_inp6xswf/embed/dynamic?&flashvars[streamerType]=auto” “Mozilla/5.0 (iPhone; CPU iPhone OS 9_3_2 like Mac OS X) AppleWebKit/601.1.46 (KHTML, like Gecko) Version/9.0 Mobile/13F69 Safari/601.1” “-” 109.70.113.100 “-” “video.quadcom.ch” 25457 1265801784 + 1191 “-” “-” “-” “-” “no-store, no-cache, must-revalidate, post-check=0, pre-check=0” -
109.70.113.100 - - [30/Jun/2016:23:50:09 +0200] “GET /api_v3/index.php?service=stats&apiVersion=3.1&expiry=86400&clientTag=kwidget%3Av2.43&format=1&ignoreNull=1&action=collect&event:eventType=1&event:clientVer=2.43&event:currentPoint=0&event:duration=2790&event:eventTimestamp=1467323409060&event:isFirstInSession=false&event:objectType=KalturaStatsEvent&event:partnerId=101&event:sessionId=81b62913-e119-f82e-6cf9-323b72bf5285&event:uiconfId=23448214&event:seek=false&event:entryId=0_inp6xswf&event:widgetId=_101&event:referrer=http%253A%252F%252Fvideo.quadcom.ch%252Findex.php%252Fextwidget%252Fpreview%252Fpartner_id%252F101%252Fuiconf_id%252F23448214%252Fentry_id%252F0_inp6xswf%252Fembed%252Fdynamic%253F%2526flashvars%255BstreamerType%255D%253Dauto&kalsig=6d02641c1713cc937a4401d4f773e92d HTTP/1.1” 200 457 0/20776 “http://video.quadcom.ch/index.php/extwidget/preview/partner_id/101/uiconf_id/23448214/entry_id/0_inp6xswf/embed/dynamic?&flashvars[streamerType]=auto” “Mozilla/5.0 (iPhone; CPU iPhone OS 9_3_2 like Mac OS X) AppleWebKit/601.1.46 (KHTML, like Gecko) Version/9.0 Mobile/13F69 Safari/601.1” “-” 109.70.113.100 “-” “video.quadcom.ch” 25011 993648063 + 1191 “-” “-” “-” “-” “no-store, no-cache, must-revalidate, post-check=0, pre-check=0” -

Also the entries are there, but last Sync date is

2 0 dim_sync_last_update -1 2016-06-20 01:03:41
3 2 fms_stale_session_days_limit 3 2016-06-21 01:03:41
8 0 referencial_integrity_last_update 0 2016-07-01 11:07:57
10 0 convert_job_fact_last_update 0 2016-06-20 01:03:41
4 0 sync_last_execution_plays_views 0 2016-06-21 01:03:41
5 0 sync_start_time_plays_views 0 2016-06-21 01:03:41
6 0 sync_last_execution_kuser_storage 0 2016-06-21 01:03:41
7 0 sync_start_time_kuser_storage 0 2016-06-21 01:03:41
9 9 transcoding_errors_last_update 0 2010-01-01 00:00:00

Edit: but the directory: TransferPath = ${CyclePath}/transfer/ as defined in kettle.properties isn’t there. Should I create the directory?
Eidt2: Errors from File: etl_update_sims.log : https://share.quadcom.ch/share/proxy/alfresco-noauth/api/internal/shared/node/content/55_KVzoyRsG3iIy2O8WobA/ErrorLog.rtf?a=true