BDB Release Notes

Product Release Notes

3.7

October 10th, 2018

Platform

Key Features:

License-Key implementation
1. Active user controlling
2. Session Control
Data Center
1. Data Service user property-based filter from the backend for RDBMS
2. Data Store refresh: Load balancing
Backend service to authorize a single device is provided for mobile app.
User Management
1. Bulk user creation using Excel upload
Language settings support
OpenDoc link for Story can be shared with users with permission to modify it
All BDB spaces reflect sample contents by default
My Account- Mobile device removal option is provided
Migration- Dashboards created based on data store service can be migrated now
Data Connectors
1. Twitter Ads
2. Google Forms
3. LinkedIn Ads
4. Postgre SQL

Enhancements:

Data Center
1. Data Store/Meta Data: LOV and Lookup Definition is provided
Shared Folder link can be accessed now from the My Documents space
JWT Token size reduction
UI improvements for enhanced user experience

Data Preparation

Key Features:

Filter on frequency chart is added with ‘OR’ condition
The ‘Search’ and ‘Sort’ are provided on frequency charts
Transforms

Date Transforms
1. Add Interval to Date
2. Extract Date
3. Find Date Difference
4. Sub-Interval to Date
Substring Extractions
1. Extract Substring at Position
2. Extract a Substring before Delimiter

Enhancements:

Performance improvements in the load of cleansing
1. Irrespective of the dataset size the UI will open in seconds

Note: The ‘Data Cleansing’ tool is in Beta.

Data Pipeline

Key Features:

BDB Data Pipeline is introduced to speed up your development by providing an easy to use framework for working with batch and streaming data inside your application. The BDB Data Pipeline contains various data readers, writers, and ingestion API for a variety of data sources and formats, along with the support of streaming data. Our pipeline can replace batch jobs with real-time data and prepare data for in-depth analysis and instant visualization.

The BDB Data Pipeline framework supports the following list of features:

Data Readers
1. SFTP
2. HDFS
3. Cassandra
4. JDBC (MYSQL, MSSQL, ORACLE, POSTGRE)
5. Elastic Search
Data Writers
1. HDFS
2. Cassandra
3. JDBC (MYSQL, MSSQL, ORACLE, POSTGRE)
4. Elastic Search
Transformation
1. Aggregation
2. Date Formatter
3. Split
4. Replace Text
5. Join
6. SQL Query
Model Runners
1. R Model
2. Spark Model
Ingestion
1. SFTP Monitoring
2. Web Socket Listener
3. Sqoop Job
Web Socket Broadcast
Custom Component Support
Job Deployment Processes/Type
1. Streamed
2. Invoked
Dynamic/Runtime update can be provided to the components
Data-lake
1. At present HDFS is supported with various file formats such as CSV, Parquet, JSON, Avro
PEM/PPK Support

Business Story

New Features:

Aggregated formulas
Charting Components
1. Pareto
2. Scatter Plot
Legend checkbox
The running summary is provided now to view the components process
All the applied filters can be displayed for a view/story
UI Enhancements
1. Menu bar customization
2. The charting theme as per dashboard designer
3. Font size standardization
Property panel enhancements for charting components to support new properties
NLP Features
1. Additional Statistics are provided in NLP UI (as KPI Tiles) which can be added to a story
2. Improved Date support in NLP (like weekly, quarterly and monthly)
3. NLP Search or Date as a dimension
4. Support for multiple measures
5. NLP Tree-map for two-dimensional data

Enhancements:

Filter Panel Improvements
1. Option in Data store/metadata to enable filter and enable lookup
2. Max Selection to be restricted to 10
3. Extended Date support is provided in the filter
4. Like and equal operation in the filter
NLP
1. Enhanced Top and Bottom keyword support
2. Enhanced date support in the filter

Dashboard Designer

Charting

New Features:

Leaflet Map
1. Marker clustering to group markers is provided when zooming-out
2. Polygon fill view is added
Spider & Circumplex charts are converted to SVG and empowered with animation
All the charting components now support language mapping.
Custom charts: Pre-scripted D3 Dual Axis Bar and D3 TreeMap are added in the component shelf
WorldMap and TreeMap charts have the option to configure custom tooltip
Custom Tooltip supports HTML tag in the tooltip which can be used to embed Image/Video
Bar &Timeline charts have a maximum size of the stack for uniform view with varying number of categories.
Inverted Funnel is provided with the option to control stack height and border properties.
Title and Axis descriptions for charts can be wrapped in multiple lines.
Repeater components can have common axis marking across the charts.
Scatter Plot chart has an option to plot the best file line for the given dataset.

Enhancements:

Timeline:
1. Category and conditional indicators are applicable at the same time
2. X-Axis markers can wrap the text in 2 or 3 lines
3. Border radius is provided on stacks
4. Chart padding and spacing are controllable from scripting
Data Store Connector:
1. Multiple Filter parameters can be passed in DataService call.
2. Conditional indicators can be configured with DS fields.
Filter:
1. Configuration option for browser standard drop-down or dropdown with search in options.
2. Control submission of filter change when none of the options are selected.
3. Hierarchical filter and list filter: option to select multiple indexes as the default selection.
Export PDF: Grid can be directly exported in PDF

Designer

New Features

Charts: Legend font size, tooltip font size, and background properties are available.
Theme: Dark and Material themes are available
Predictive Connector: Summary and Bokeh visual from R workflows can be displayed in the dashboard.

Enhancements

SDK Methods:

startDashboardTour: To configure the guided tour inside the dashboard
injectCSSRules: Inject CSS rule to override any existing style
setStatusMessage: To show a popover message which hides after a given timeout

Predictive Workbench

New Features:

User Interface in the NN model for training and re-training
Migration tool for PA
Upgrade Migration
Exporting trained models to Data Pipeline
AutoML
Refactoring and restructuring
Predefined Scripts
1. The predefined scripts provided in the R Workspace
  1. Weighted Least Squares Regression (WLS relative Std, WLSR Input Weights)
  2. Fast Forest Quantile Regression
  3. Singular Value Decomposition
  4. Linear Discriminant Analysis [LDA Count, LDA Feature Select]
  5. Bayesian Linear Regression
  6. Hierarchical Clustering
  7. Stepwise Regression
  8. Ordinal Regression
  9. Isolation Forest
  10. Factor Analysis
  11. Optimal K Value
  12. EM algorithm
  13. Elastic Net
  14. K-Means++
  15. Boosting
  16. ADA Boost
  17. Bagging
  18. K-NN
  19. GBM
  20. PCA
2. The predefined scripts provided in the Python Workspace
  1. Auto ML
  2. Naive Bayes Classification
  3. Random Forest
  4. Hierarchical Clustering
  5. K-NN
  6. LDA Feature Selection
  7. LDA Prediction
  8. PCA
  9. Decision Tree
  10. Gradient Boosting Model
  11. XG Boost
  12. ADA Boost
  13. Extremely Randomized Trees
  14. K means

Enhancements:

A ‘Reset’ button is provided in the NN Workspace.
The data management limit will not be applicable for Python and Spark data query.
The ‘Settings’ tab changes are provided for the Custom Scripts
Custom python scripts now support without dynamic fields.
UI validation in the Apply Model for algorithm/data preparation/model reader component
R working directory is taken off from the Scheduler configurations in Admin settings.
Mechanism to remove the related scripts is introduced if the NN model directory is deleted.
The workflow status is updated in DB instead of removing it for the deleted workflows service.

Notes:The following performance tests are successfully conducted on the Predictive Workbench R-3.7 environment.

Executed Random forest with PySpark for 1B rows.
Python SGD Regression with Python for 50M rows.

Mobile Apps

New Features:

Single Device Setup is introduced to restrict the use of multiple mobile devices for a single account
NLP Display
1. Additional stats in NLP output (KPI Tiles)
Wiki Microblogging has been provided for stories
Users can share open document link via email
User Management Settings: The ‘Change Password’ option is provided

Enhancements:

UI enhancements in NLP to support inter switching of charts

Recommendations

Product	Description
Data Pipeline	Following are the basic requirements to deploy Data Pipeline on-premises: Kubernetes Master – 8 GB RAM 4 Core Kubernetes Slave – 32 GB RAM 4 Core (Min: 3 Slaves) Kubernetes 1.9 or Higher
Predictive Workbench
	The user must update the compile method present in the script after a model is structured through the NN User Interface.
	The SFTP configuration should be similar to the export user and the import user.
Dashboard Designer
	Wrap field names in the square bracket when used in calculated fields or in dataset filter script, which will improve data processing and allow to use fields with special characters.
	Make sure that ‘sync’ property is turned off in the Preference menu when script written on labels for creating a mobile view is required to be different from the desktop mode.
	Do not override any charting/framework method to make your script work for the temporary purpose. It may break your dashboard in future when a new version of BDB DD is released.
Data Preparation	Use data cleansing with ETL for datasets with more than 10K rows.

Known Limitations:

Product	Description
Data Pipeline
	Kafka Offset manual commits are not supported for dynamic/runtime update of components. No scheduler support for components.
	Stopping components in running pipeline is not supported.
	No Kerberos support is provided for Kafka and HDFS data.
	WebSocket producer Ingress for dynamic URI creation is not supported.
	The ‘Encryption’ feature is not supported.
Predictive Workbench
	All Keras Layers are not present in NN User Interface, though users can add those in Scripting Part.
	The target user should have the same model folder name while migrating the NumPy scripts.
	NN workflows do not support in dashboards.
	Due to UI changes the PA migration module does not work in IE.
	Not able to import BAF file for multiple users at a time from non-admin users.
	Date records mentioned with ‘AM’ or ‘PM’ cannot write into Database.
Dashboard Designer
	Export to PPT/PDF/PNG does not support google fonts, so there may be a mismatch on what you see on the browser and what is exported on PDF/PPT/PNG.
	The ‘Export’ functionality can take 5-15 seconds in chrome and ~30 seconds in IE based on the size of the dashboard.
	IE browser does not support Google fonts such as Roboto, Raleway, etc.
	Data Grid should not contain more than 500 records to avoid slow loading of the dashboard.
	Leaflet and Map components do not support the ‘Export’ option.
	Dashboard Designer does not support the Excel/CSV file of more than 3MB.
Data Preparation
	In place editing is yet to be implemented in the Grid.
	Reordering and deletion of steps are not implemented yet.
Business Story
	Date Filter: Relative search does not work with ‘Days’ as the value
	Aggregation as “None’ does not work with data store merge and measure filter
	Dimension LOV filter: Value is case sensitive
	Aggregated Formula does not work with measures
	Data Label position does not work with middle and bottom options
	The ‘Sort’ option does not work while dropping more than one dimensions

Connect with BDB Expert

Connect Now