Backend app for visualizing CANedge log files in Grafana (directly from local disk or S3)

Overview

CANedge Grafana Backend - Visualize CAN/LIN Data in Dashboards

This project enables easy dashboard visualization of log files from the CANedge CAN/LIN data logger.

Specifically, a light-weight backend app loads, DBC decodes and parses MDF log files from local disk or an S3 server. This is done 'on demand' in response to query requests sent from a Grafana dashboard frontend by end users.

This project is currently in BETA - major changes will be made.

CAN Bus Grafana Dashboard

Backend vs. Writer

We provide two options for integrating your CANedge data with Grafana dashboards:

The CANedge Grafana Backend app only processes data 'when needed' by an end user - and requires no database. It is ideal when you have large amounts of data - as you only process the data you need to visualize.

The CANedge InfluxDB Writer processes data in advance (e.g. periodically or on-file-upload) and writes it to a database. It is ideal if dashboard loading speed is critical - but with the downside that data is processed & stored even if it is not used.

For details incl. 'pros & cons', see our intro to telematics dashboards.


Features

- allow users to visualize data from all of your devices & log files in Grafana 
- data is only processed "on request" - avoiding the need for costly databases
- data can be fetched from local disk or S3
- data can be visualized as soon as log files are uploaded to S3 for 'near real-time updates'
- the backend app can be easily deployed on e.g. your PC or AWS EC2 instance 
- plug & play dashboard templates & sample data let you get started quickly 
- view log file sessions & splits via Annotations, enabling easy identification of underlying data 
- allow end users control over what devices/signals are displayed via flexible Variables

Installation

In this section we detail how to deploy the app on a PC or an AWS EC2 instance.

Note: We recommend to test the local deployment with our sample data as the first step.


1: Deploy the integration locally on your PC

A local PC deployment is recommended if you wish to load data from an SD, local disk or MinIO S3.

Deploy the backend app locally

  • Install Python 3.7 for Windows (32 bit/64 bit) or Linux (enable 'Add to PATH')
  • Download this project as a zip via the green button and unzip it
  • Open the folder with the requirements.txt file and enter below in your command prompt:
Windows
python -m venv env & env\Scripts\activate & pip install -r requirements.txt
python canedge_datasource_cli.py "file:///%cd%/LOG" --port 8080
Linux
python3 -m venv env && source env/bin/activate && pip install -r requirements.txt
python3 canedge_datasource_cli.py file:///$PWD/LOG --port 8080

Set up Grafana locally

  • Install Grafana locally and enter http://localhost:3000 in your browser to open Grafana
  • In Configuration/Plugins install SimpleJson and TrackMap
  • In Configuration/DataSources select Add datasource and SimpleJson and set it as the 'default'
  • Enter the URL http://localhost:8080/, hit Save & test and verify that it works
  • In Dashboards/Browse click Import and load the dashboard-template-sample-data.json from this repo

You should now see the sample data visualized in Grafana.

Next: If you aim to work with CANedge2 data from AWS S3, go to step 2 - otherwise go to step 3.


2: Deploy the integration on AWS EC2 & Grafana Cloud

An AWS EC2 instance is recommended if you wish to load data from your AWS S3 bucket.

Deploy the backend app on AWS EC2

  • Login to AWS, search for EC2/Instances and click Launch instances
  • Select Ubuntu Server 20.04 LTS (HVM), SSD Volume Type, t3.small and proceed
  • In Step 6, click Add Rule/Custom TCP Rule and set Port Range to 8080
  • Launch the instance, then create & store your credentials (we will not use them for now)
  • Wait ~5 min, click on your instance and note your IP (the Public IPv4 address)
  • Click Connect/Connect to enter the GUI console, then enter the following:
sudo apt update && sudo apt install python3 python3-pip python3-venv tmux 
git clone https://github.com/CSS-Electronics/canedge-grafana-backend.git && cd canedge-grafana-backend
python3 -m venv env && source env/bin/activate && pip install -r requirements.txt
tmux
python3 canedge_datasource_cli.py file:///$PWD/LOG --port 8080

Set up Grafana Cloud

  • Set up a free Grafana Cloud account and log in
  • In Configuration/Plugins install SimpleJson and TrackMap (log out and in again)
  • In Configuration/DataSources select Add datasource and SimpleJson and set it as the 'default'
  • Replace your datasource URL with the http://[IP]:[port] endpoint and click Save & test
  • In Dashboards/Browse click Import and load the dashboard-template-sample-data.json from this repo

You should now see the sample data visualized in your imported dashboard. In the AWS EC2 console you can press ctrl + B then D to de-attach from the session, allowing it to run even when you close the GUI console.

Next: See step 3 on loading your AWS S3 data and step 5 on deploying the app as a service for production.


3: Load your own data & DBC files

Below we outline how to load your own data & DBC files.

Note: To activate your virtual environment use env\Scripts\activate (Linux: source env/bin/activate)

Load from local disk

  • Replace the sample LOG/ folder with your own LOG/ folder (or add an absolute path)
  • Verify that your data is structured as on the CANedge SD card i.e. [device_id]/[session]/[split].MF4
  • Add your DBC file(s) to the root of the folder
  • Verify that your venv is active and start the app

Load from S3

  • Add your DBC file(s) to the root of your S3 bucket
  • Verify that your venv is active and start the app with below syntax (use python3 on Linux/EC2)
python canedge_datasource_cli.py [endpoint] --port 8080 --s3_ak [access_key] --s3_sk [secret_key] --s3_bucket [bucket]
  • AWS S3 endpoint example: https://s3.eu-central-1.amazonaws.com
  • Google S3 endpoint example: https://storage.googleapis.com
  • MinIO S3 endpoint example: http://192.168.192.1:9000

Import simplified dashboard template

  • To get started, import the dashboard-template-simple.json to visualize your own data
  • After this, you can start customizing your panels as explained in step 4

Regarding DBC files

You can load as many DBC files as you want without reducing performance, as long as your queries only use one at a time (as is e.g. the case when using the simple dashboard template). However, if your queries need to use multiple DBC files, you may consider 'combining' your DBC files for optimal performance.

Regarding compression

It is recommended to enable the CANedge compression as the compressed MFC files are 50%+ smaller and thus faster to load.


4: Customize your Grafana dashboard

The dashboard-template-sample-data.json can be used to identify how to make queries, incl. below examples:

# create a fully customized query that depends on what the user selects in the dropdown 
{"device":"${DEVICE}","itf":"${ITF}","chn":"${CHN}","db":"${DB}","signal":"${SIGNAL}"}

# create a query for a panel that locks a signal, but keeps the device selectable
{"device":"${DEVICE}","itf":"CAN","chn":"CH2","db":"canmod-gps","signal":"Speed"}

# create a query for parsing multiple signals, e.g. for a TrackMap plot
{"device":"${DEVICE}","itf":"CAN","chn":"CH2","db":"canmod-gps","signal":"(Latitude|Longitude)"}

Bundle queries for multiple panels

When displaying multiple panels in your dashboard, it is critical to setup all queries in a single panel (as in our sample data template). All other panels can then be set up to refer to the original panel by setting the datasource as -- Dashboard --. For both the 'query panel' and 'referring panels' you can then use the Transform tab to Filter data by query. This allows you to specify which query should be displayed in which panel. The end result is that only 1 query is sent to the backend - which means that your CANedge log files are only processed once per update.

Set up Grafana Variables & Annotations

Grafana Variables allow users to dynamically control what is displayed in certain panels via dropdowns. For details on how the Variables are defined, see the template dashboard under Settings/Variables.

Similarly, Annotations can be used to display when a new log file 'session' or 'split' occurs, as well as display the log file name. This makes it easy to identify the log files underlying a specific view - and then finding these via CANcloud or TntDrive for further processing.

Regarding performance

Using the 'zoom out' button repeatedly will currently generate a queue of requests, each of which will be processed by the backend. Until this is optimized, we recommend to make a single request a time - e.g. by using the time period selector instead of the 'zoom out' button.

Also, loading speed increases when displaying long time periods (as the data for the period is processed in real-time).


5: Move to a production setup

Managing your EC2 tmux session

Below commands are useful in managing your tmux session while you're still testing your deployment.

  • tmux: Start a session
  • tmux ls: List sessions
  • tmux attach: Re-attach to session
  • tmux kill-session: Stop session

Deploy your app as an EC2 service for production

The above setup is suitable for development & testing. Once you're ready to deploy for production, you may prefer to set up a service. This ensures that your app automatically restarts after an instance reboot or a crash. To set it up as a service, follow the below steps:

  • Ensure you've followed the previous EC2 steps incl. the virtual environment
  • Update the ExecStart line in the canedge_grafana_backend.service 'unit file' with your S3 details
  • Upload the modified file to get a public URL
  • In your EC2 instance, use below commands to deploy the file
sudo wget -N [your_file_url]
sudo cp canedge_grafana_backend.service /etc/systemd/system/
sudo systemctl daemon-reload
sudo systemctl start canedge_grafana_backend
sudo systemctl enable canedge_grafana_backend
sudo journalctl -f -u canedge_grafana_backend

The service should now be deployed, which you can verify via the console output. If you need to make updates to your unit file, simply repeat the above. You can stop the service via sudo systemctl stop [service].

Regarding EC2 costs

You can find details on AWS EC2 pricing here. A t3.small instance typically costs ~0.02$/hour (~15-20$/month). We recommend that you monitor usage during your tests early on to ensure that no unexpected cost developments occur. Note also that you do not pay for the data transfer from S3 into EC2 if deployed within the same region.

Regarding public EC2 IP

Note that rebooting your EC2 instance will imply that your endpoint IP is changed - and thus you'll need to update your datasource. There are methods to set a fixed IP, though not in scope of this README.

Port forwarding a local deployment

If you want to access the data remotely, you can set up port forwarding. Below we outline how to port forward the backend app for use as a datasource in Grafana Cloud - but you could of course also directly port forward your local Grafana dashboard directly via port 3000.

  • Set up port forwarding on your WiFi router for port 8080
  • Run the app again (you may need to allow access via your firewall)
  • Find your public IP to get your endpoint as: http://[IP]:[port] (e.g. http://5.105.117.49:8080/)
  • In Grafana, add your new endpoint URL and click Save & test

Pending tasks

Below are a list of pending items:

  • Optimize Flask/Waitress session management for stability
  • Improve performance for multiple DBC files
  • Update code/guide for TLS-enabled deployment
  • Provide guidance on how to best scale the app for multiple front-end users
  • Determine if using Browser in SimpleJson datasource improves performance (requires TLS)
You might also like...
A command line tool for visualizing CSV/spreadsheet-like data
A command line tool for visualizing CSV/spreadsheet-like data

PerfPlotter Read data from CSV files using pandas and generate interactive plots using bokeh, which can then be embedded into HTML pages and served by

The Timescale NFT Starter Kit is a step-by-step guide to get up and running with collecting, storing, analyzing and visualizing NFT data from OpenSea, using PostgreSQL and TimescaleDB.

Timescale NFT Starter Kit The Timescale NFT Starter Kit is a step-by-step guide to get up and running with collecting, storing, analyzing and visualiz

Python toolkit for defining+simulating+visualizing+analyzing attractors, dynamical systems, iterated function systems, roulette curves, and more
Python toolkit for defining+simulating+visualizing+analyzing attractors, dynamical systems, iterated function systems, roulette curves, and more

Attractors A small module that provides functions and classes for very efficient simulation and rendering of iterated function systems; dynamical syst

A python-generated website for visualizing the novel coronavirus (COVID-19) data for Greece.
A python-generated website for visualizing the novel coronavirus (COVID-19) data for Greece.

COVID-19-Greece A python-generated website for visualizing the novel coronavirus (COVID-19) data for Greece. Data sources Data provided by Johns Hopki

Visualizing weather changes across the world using third party APIs and Python.
Visualizing weather changes across the world using third party APIs and Python.

WEATHER FORECASTING ACROSS THE WORLD Overview Python scripts were created to visualize the weather for over 500 cities across the world at varying di

Tools for calculating and visualizing Elo-like ratings of MLB teams using Retosheet data
Tools for calculating and visualizing Elo-like ratings of MLB teams using Retosheet data

Overview This project uses historical baseball games data to calculate an Elo-like rating for MLB teams based on regular season match ups. The Elo rat

Generate SVG (dark/light) images visualizing (private/public) GitHub repo statistics for profile/website.

Generate daily updated visualizations of GitHub user and repository statistics from the GitHub API using GitHub Actions for any combination of private and public repositories, whether owned or contributed to - no server required.

Pydrawer: The Python package for visualizing curves and linear transformations in a super simple way

pydrawer 📐 The Python package for visualizing curves and linear transformations in a super simple way. ✏️ Installation Install pydrawer package with

Curvipy - The Python package for visualizing curves and linear transformations in a super simple way

Curvipy - The Python package for visualizing curves and linear transformations in a super simple way

Comments
  • Issue with deploying the app locally (1: Deploy the integration locally on your PC)

    Issue with deploying the app locally (1: Deploy the integration locally on your PC)

    Commit : 196aa9efc4ca892554eabff52175e6b2523b3e6 (tag : v1.0.0)

    Platform : Windows 10

    I have followed these steps : https://github.com/CSS-Electronics/canedge-grafana-backend#1-deploy-the-integration-locally-on-your-pc

    The application starts up fine and is validated by SimpleJson; As soon as I attempt to import the dashboard dashboard-template-sample-data.json, the following error appears :

    (env) C:\{SNIPPED/PATH/}canedge-grafana-backend>python canedge_datasource_cli.py "file:///%cd%/LOG" --port 8080 --limit 100
    Mount path: file:///C:\{SNIPPED/PATH/}canedge-grafana-backend/LOG
    Loaded DBs: canmod-gps
    2022-04-27 08:42:36,438 - waitress - INFO - Serving on http://0.0.0.0:8080
    2022-04-27 08:54:08,622 - canedge_datasource.annotations - WARNING - Failed to annotate: int() argument must be a string, a bytes-like object or a number, not 'dict'
    2022-04-27 08:54:08,628 - canedge_datasource.annotations - WARNING - Failed to annotate: int() argument must be a string, a bytes-like object or a number, not 'dict'
    2022-04-27 08:54:08,854 - canedge_datasource - ERROR - Exception on /query [POST]
    Traceback (most recent call last):
      File "C:\{SNIPPED/PATH/}canedge-grafana-backend\env\lib\site-packages\flask\app.py", line 2073, in wsgi_app
        response = self.full_dispatch_request()
      File "C:\{SNIPPED/PATH/}canedge-grafana-backend\env\lib\site-packages\flask\app.py", line 1518, in full_dispatch_request
        rv = self.handle_user_exception(e)
      File "C:\{SNIPPED/PATH/}canedge-grafana-backend\env\lib\site-packages\flask\app.py", line 1516, in full_dispatch_request
        rv = self.dispatch_request()
      File "C:\{SNIPPED/PATH/}canedge-grafana-backend\env\lib\site-packages\flask\app.py", line 1502, in dispatch_request
        return self.ensure_sync(self.view_functions[rule.endpoint])(**req.view_args)
      File "C:\{SNIPPED/PATH/}canedge-grafana-backend\canedge_datasource\query.py", line 130, in query_view
        return jsonify(query_cache(req_in))
      File "C:\{SNIPPED/PATH/}canedge-grafana-backend\env\lib\site-packages\flask_caching\__init__.py", line 952, in decorated_function
        rv = f(*args, **kwargs)
      File "C:\{SNIPPED/PATH/}canedge-grafana-backend\canedge_datasource\query.py", line 117, in query_cache
        res = _query_time_series(req, start_date, stop_date)
      File "C:\{SNIPPED/PATH/}canedge-grafana-backend\canedge_datasource\query.py", line 171, in _query_time_series
        return time_series_phy_data(fs=app.fs,
      File "C:\{SNIPPED/PATH/}canedge-grafana-backend\canedge_datasource\signal.py", line 202, in time_series_phy_data
        log_files = canedge_browser.get_log_files(fs, device, start_date=start_date, stop_date=stop_date,
      File "C:\{SNIPPED/PATH/}canedge-grafana-backend\env\lib\site-packages\canedge_browser\listing.py", line 152, in get_log_files
        selected_sessions = _bisect_list(
      File "C:\{SNIPPED/PATH/}canedge-grafana-backend\env\lib\site-packages\canedge_browser\listing.py", line 354, in _bisect_list
        start_index = bisect.bisect_left(bisect_list, lower_bound)
      File "C:\{SNIPPED/PATH/}canedge-grafana-backend\env\lib\site-packages\canedge_browser\support\FuncBackedList.py", line 39, in __getitem__
        self._values[item] = self._func(key)
      File "C:\{SNIPPED/PATH/}canedge-grafana-backend\env\lib\site-packages\canedge_browser\support\FuncBackedList.py", line 15, in <lambda>
        self._func = lambda x: func(x, *args, **kwargs)
      File "C:\{SNIPPED/PATH/}canedge-grafana-backend\env\lib\site-packages\canedge_browser\listing.py", line 325, in _extract_date_from_session_wrapper
        result = extract_date(handle, passwords)
      File "C:\{SNIPPED/PATH/}canedge-grafana-backend\env\lib\site-packages\canedge_browser\listing.py", line 239, in _extract_date_mdf4
        mdf_file = mdf_iter.MdfFile(*args)
    TypeError: int() argument must be a string, a bytes-like object or a number, not 'dict'
    
    opened by fstojanovic 3
Releases(v1.1.2)
A minimalistic wrapper around PyOpenGL to save development time

glpy glpy is pyOpenGl wrapper which lets you work with pyOpenGl easily.It is not meant to be a replacement for pyOpenGl but runs on top of pyOpenGl to

Abhinav 9 Apr 02, 2022
Learn Basic to advanced level Data visualisation techniques from this Repository

Data visualisation Hey, You can learn Basic to advanced level Data visualisation techniques from this Repository. Data visualization is the graphic re

Shashank dwivedi 16 Jan 03, 2023
Draw interactive NetworkX graphs with Altair

nx_altair Draw NetworkX graphs with Altair nx_altair offers a similar draw API to NetworkX but returns Altair Charts instead. If you'd like to contrib

Zachary Sailer 206 Dec 12, 2022
University of Missouri - Kansas City: CS451R: Capstone

CS451RC University of Missouri - Kansas City: CS451R: Capstone Installation cd git clone https://github.com/ala2q6/CS451RC.git cd CS451RC pip3 instal

Alex Arbuckle 1 Nov 17, 2021
Process dataframe in a easily way.

Popanda Written by Shengxuan Wang at OSU. Used for processing dataframe, especially for machine learning. The name is from "Po" in the movie Kung Fu P

ShawnWang 1 Dec 24, 2021
Multi-class confusion matrix library in Python

Table of contents Overview Installation Usage Document Try PyCM in Your Browser Issues & Bug Reports Todo Outputs Dependencies Contribution References

Sepand Haghighi 1.3k Dec 31, 2022
Plot and save the ground truth and predicted results of human 3.6 M and CMU mocap dataset.

Visualization-of-Human3.6M-Dataset Plot and save the ground truth and predicted results of human 3.6 M and CMU mocap dataset. human-motion-prediction

Gaurav Kumar Yadav 5 Nov 18, 2022
Main repository for Vispy

VisPy: interactive scientific visualization in Python Main website: http://vispy.org VisPy is a high-performance interactive 2D/3D data visualization

vispy 3k Jan 03, 2023
Small project to recursively calculate and plot each successive order of the Hilbert Curve

hilbert-curve Small project to recursively calculate and plot each successive order of the Hilbert Curve. After watching 3Blue1Brown's video on Hilber

Stefan Mejlgaard 2 Nov 15, 2021
A programming language built on top of Python to easily allow Swahili speakers to get started with programming without ever knowing English

pyswahili A programming language built over Python to easily allow swahili speakers to get started with programming without ever knowing english pyswa

Jordan Kalebu 72 Dec 15, 2022
Lightweight, extensible data validation library for Python

Cerberus Cerberus is a lightweight and extensible data validation library for Python. v = Validator({'name': {'type': 'string'}}) v.validate({

eve 2.9k Dec 27, 2022
An(other) implementation of JSON Schema for Python

jsonschema jsonschema is an implementation of JSON Schema for Python. from jsonschema import validate # A sample schema, like what we'd get f

Julian Berman 4k Jan 04, 2023
Python & Julia port of codes in excellent R books

X4DS This repo is a collection of Python & Julia port of codes in the following excellent R books: An Introduction to Statistical Learning (ISLR) Stat

Gitony 5 Jun 21, 2022
Colormaps for astronomers

cmastro: colormaps for astronomers 🔭 This package contains custom colormaps that have been used in various astronomical applications, similar to cmoc

Adrian Price-Whelan 12 Oct 11, 2022
A little word cloud generator in Python

Linux macOS Windows PyPI word_cloud A little word cloud generator in Python. Read more about it on the blog post or the website. The code is tested ag

Andreas Mueller 9.2k Dec 30, 2022
Ana's Portfolio

Ana's Portfolio ✌️ Welcome to my Portfolio! You will find here different Projects I have worked on (from scratch) 💪 Projects 💻 1️⃣ Hangman game (Mad

Ana Katherine Cortes Sobrino 9 Mar 15, 2022
Practical-statistics-for-data-scientists - Code repository for O'Reilly book

Code repository Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python by Peter Bruce, Andrew Bruce, and Peter Gedeck Pub

1.7k Jan 04, 2023
flask extension for integration with the awesome pydantic package

Flask-Pydantic Flask extension for integration of the awesome pydantic package with Flask. Installation python3 -m pip install Flask-Pydantic Basics v

249 Jan 06, 2023
基于python爬虫爬取COVID-19爆发开始至今全球疫情数据并利用Echarts对数据进行分析与多样化展示。

COVID-19-Epidemic-Map 基于python爬虫爬取COVID-19爆发开始至今全球疫情数据并利用Echarts对数据进行分析与多样化展示。 觉得项目还不错的话欢迎给一个star! 项目的源码可以正常运行,各个库的版本、数据库的建表语句、运行过程中遇到的坑以及解决方式在笔记.md中都

31 Dec 15, 2022
Bioinformatics tool for exploring RNA-Protein interactions

Explore RNA-Protein interactions. RNPFind is a bioinformatics tool. It takes an RNA transcript as input and gives a list of RNA binding protein (RBP)

Nahin Khan 3 Jan 27, 2022