AZURE SYNAPSE ANALYTICS – PART 1

What is Azure Synapse Analytics?

      • It is an enterprise analytics service

      • You can quickly get insights across data warehouses and big data systems
            • You can use:
              • SQL for data warehousing
              • Spark for big data
              • Data Explorer for log and time series analytics
              • Pipelines for data integration and ETL/ELT
              • Integration with Power BI, Cosmos DB and AzureML

       Synapse SQL

          • Distributed query system for T-SQL

          • Ideal for:
            • Data warehousing
            • Data virtualization
            • Streaming
            • Machine Learning

          • Offers both serverless and dedicated resources models

          • Dedicated SQL pools work best for predictable performance and cost

          • Serverless SQL endpoints are suitable for unplanned or bursty workloads

          • Land streaming data from cloud data sources into SQL tables using built-in streaming capabilities

          • Integrate AI with SQL using machine learning models

        Apache Spark for Azure Synapse

          • Most popular open-source big data engine used for:
            • Data Preparation
            • Data Engineering
            • ELT
            • Machine Learning

          • Free from managing clusters

          • Fast Spark start-up

          • Aggressive autoscaling

          • Built-in support for .NET for Spark

        Data Lake

          • With Azure Synapse, SQL and Spark can be used together

          • SQL and Spark can directly explore and analyze Parquet, CSV, TSC, and JSON files in the data lake

          • Fast, scalable data loading between SQL and Spark databases

        Built-in Data Integration

          • The data integration engine and experiences are the same as Azure Data Factory

          • You can create rich at-scale ETL pipelines without leaving Azure Synapse Analytics

          • Ingest data from 90+ data sources

          • Code-Free ETL with Data flow activities

          • Orchestrate notebooks, Spark jobs, stored procedures, SQL scripts and more

        Data Explorer

          • Customers can interactively query to unlock insights from log and telemetry data

          • Data Explorer analytics runtime is optimized for efficient log analytics using powerful indexing technology to automatically index free-text and semi-structured data commonly found in the telemetry data

          • Ideal for building near real-time log analytics and IOT analytics solutions

          • Logs and event data across on-premises, cloud, and third-party data sources can be all consolidated and correlated

          • Move faster with your AI Ops journey in pattern recognition, anomaly detection, forecasting and more

          • Save cost and increase productivity by replacing infrastructure-based log search

          • Build IoT Analytics solution for your IoT data

          • Build Analytical SaaS solutions

        Unified Experience

          • You can build solutions, maintain, and secure all in a single user experience through Synapse Studio

          • Perform key tasks: ingest, explore, prepare, orchestrate, visualize

          • Monitor resources, usage, and users across SQL, Spark and Data Explorer

          • Use -Role-based access control to simplify access to analytics resources

          • Write SQL, Spark or KQL code and integrate with enterprise CI/CD processes

        ITECHSTORECA

        FOR ALL YOUR TECH SOLUTIONS