List of Kedro Integrations

This is a list of platforms and tools that integrate with Kedro. This list is updated as of April 2025.

  • 1
    Vertex AI Reviews & Ratings

    Vertex AI

    Google

    Effortlessly build, deploy, and scale custom AI solutions.
    More Information
    Company Website
    Company Website
    Completely managed machine learning tools facilitate the rapid construction, deployment, and scaling of ML models tailored for various applications. Vertex AI Workbench seamlessly integrates with BigQuery Dataproc and Spark, enabling users to create and execute ML models directly within BigQuery using standard SQL queries or spreadsheets; alternatively, datasets can be exported from BigQuery to Vertex AI Workbench for model execution. Additionally, Vertex Data Labeling offers a solution for generating precise labels that enhance data collection accuracy. Furthermore, the Vertex AI Agent Builder allows developers to craft and launch sophisticated generative AI applications suitable for enterprise needs, supporting both no-code and code-based development. This versatility enables users to build AI agents by using natural language prompts or by connecting to frameworks like LangChain and LlamaIndex, thereby broadening the scope of AI application development.
  • 2
    Docker Reviews & Ratings

    Docker

    Docker

    Streamline development with portable, reliable containerized applications.
    Docker simplifies complex configuration tasks and is employed throughout the entire software development lifecycle, enabling rapid, straightforward, and portable application development on desktop and cloud environments. This comprehensive platform offers various features, including user interfaces, command-line utilities, application programming interfaces, and integrated security, which all work harmoniously to enhance the application delivery process. You can kickstart your programming projects by leveraging Docker images to create unique applications compatible with both Windows and Mac operating systems. With the capabilities of Docker Compose, constructing multi-container applications becomes a breeze. In addition, Docker seamlessly integrates with familiar tools in your development toolkit, such as Visual Studio Code, CircleCI, and GitHub, enhancing your workflow. You can easily package your applications into portable container images, guaranteeing consistent performance across diverse environments, whether on on-premises Kubernetes or cloud services like AWS ECS, Azure ACI, or Google GKE. Furthermore, Docker provides access to a rich repository of trusted assets, including official images and those from verified vendors, ensuring that your application development is both reliable and high-quality. Its adaptability and integration capabilities position Docker as an essential tool for developers striving to boost their productivity and streamline their processes, making it indispensable in modern software development. This ensures that developers can focus more on innovation and less on configuration management.
  • 3
    Jupyter Notebook Reviews & Ratings

    Jupyter Notebook

    Project Jupyter

    Empower your data journey with interactive, collaborative insights.
    Jupyter Notebook is a versatile, web-based open-source application that allows individuals to generate and share documents that include live code, visualizations, mathematical equations, and textual descriptions. Its wide-ranging applications include data cleaning, statistical modeling, numerical simulations, data visualization, and machine learning, highlighting its adaptability across different domains. Furthermore, it acts as a superb medium for collaboration and the exchange of ideas among professionals within the data science community, fostering innovation and collective learning. This collaborative aspect enhances its value, making it an essential tool for both beginners and experts alike.
  • 4
    Python Reviews & Ratings

    Python

    Python

    Unlock endless programming potential with a welcoming community.
    At the core of extensible programming is the concept of defining functions. Python facilitates this with mandatory and optional parameters, keyword arguments, and the capability to handle arbitrary lists of arguments. Whether you're a novice in programming or possess years of expertise, Python remains approachable and easy to grasp. This language is notably inviting for newcomers while still providing considerable depth for those experienced in other programming languages. The following sections lay a strong groundwork for anyone eager to start their Python programming adventure! The dynamic community actively organizes various conferences and meetups to foster collaborative coding and the exchange of ideas. Furthermore, the comprehensive documentation acts as an invaluable guide, while mailing lists help maintain user connections. The Python Package Index (PyPI) offers a wide selection of third-party modules that enhance the Python experience. With an extensive standard library alongside community-contributed modules, Python presents endless programming possibilities, making it an adaptable choice for developers at every skill level. Additionally, the thriving ecosystem encourages continuous learning and innovation among its users.
  • 5
    Plotly Dash Reviews & Ratings

    Plotly Dash

    Plotly

    Empower analytics with seamless web apps, no coding required.
    Dash and Dash Enterprise empower users to create and distribute analytic web applications utilizing Python, R, or Julia, eliminating the need for JavaScript or DevOps expertise. Leading companies worldwide leverage AI, machine learning, and Python analytics, achieving remarkable results at a significantly lower expense compared to traditional full-stack development. Dash serves as their solution. Applications and dashboards capable of executing sophisticated analyses, including natural language processing, forecasting, and computer vision, can be efficiently delivered. You have the flexibility to work in Python, R, or Julia, and by transitioning from outdated per-seat license software to Dash Enterprise's unlimited end-user pricing model, you can significantly cut costs. Dash enables rapid deployment and updates of applications without requiring a dedicated IT or DevOps team. Furthermore, you can design visually stunning web apps and dashboards without any need for CSS coding. Kubernetes simplifies scaling processes, and the platform also ensures high availability for essential Python applications, making it an ideal choice for businesses looking to enhance their analytical capabilities. Overall, Dash and Dash Enterprise revolutionize the way organizations approach analytics and application development.
  • 6
    pandas Reviews & Ratings

    pandas

    pandas

    Powerful data analysis made simple and efficient for everyone.
    Pandas is a versatile open-source library for data analysis and manipulation that excels in speed and power while maintaining a user-friendly interface within the Python ecosystem. It supports a wide range of data formats for both importing and exporting, such as CSV, text documents, Microsoft Excel, SQL databases, and the efficient HDF5 format. The library stands out with its intelligent data alignment features and its adept handling of missing values, allowing for seamless label-based alignment during calculations, which greatly aids in the organization of chaotic datasets. Moreover, pandas includes a sophisticated group-by engine that facilitates complex aggregation and transformation tasks, making it simple for users to execute split-apply-combine operations on their data. In addition to these capabilities, pandas is equipped with extensive time series functions that allow for the creation of date ranges, frequency conversions, and moving window statistics, as well as managing date shifting and lagging. Users also have the flexibility to define custom time offsets for specific applications and merge time series data without losing any critical information. Ultimately, the comprehensive array of features offered by pandas solidifies its status as an indispensable resource for data professionals utilizing Python, ensuring they can efficiently handle a diverse range of data-related tasks.
  • 7
    Amazon SageMaker Reviews & Ratings

    Amazon SageMaker

    Amazon

    Empower your AI journey with seamless model development solutions.
    Amazon SageMaker is a robust platform designed to help developers efficiently build, train, and deploy machine learning models. It unites a wide range of tools in a single, integrated environment that accelerates the creation and deployment of both traditional machine learning models and generative AI applications. SageMaker enables seamless data access from diverse sources like Amazon S3 data lakes, Redshift data warehouses, and third-party databases, while offering secure, real-time data processing. The platform provides specialized features for AI use cases, including generative AI, and tools for model training, fine-tuning, and deployment at scale. It also supports enterprise-level security with fine-grained access controls, ensuring compliance and transparency throughout the AI lifecycle. By offering a unified studio for collaboration, SageMaker improves teamwork and productivity. Its comprehensive approach to governance, data management, and model monitoring gives users full confidence in their AI projects.
  • 8
    Matplotlib Reviews & Ratings

    Matplotlib

    Matplotlib

    Create stunning static and interactive visualizations effortlessly!
    Matplotlib is a flexible library that facilitates the creation of static, animated, and interactive graphs in Python. It not only makes it easy to generate simple plots but also supports the development of intricate visualizations. A wide range of third-party extensions further amplifies Matplotlib's functionality, offering sophisticated plotting interfaces like Seaborn, HoloViews, and ggplot, as well as mapping and projection tools such as Cartopy. This rich ecosystem empowers users to customize their visual outputs according to individual requirements and tastes. Additionally, the continuous growth of the community around Matplotlib ensures that innovative features and improvements are regularly introduced, enhancing the overall user experience.
  • 9
    Azure Machine Learning Reviews & Ratings

    Azure Machine Learning

    Microsoft

    Streamline your machine learning journey with innovative, secure tools.
    Optimize the complete machine learning process from inception to execution. Empower developers and data scientists with a variety of efficient tools to quickly build, train, and deploy machine learning models. Accelerate time-to-market and improve team collaboration through superior MLOps that function similarly to DevOps but focus specifically on machine learning. Encourage innovation on a secure platform that emphasizes responsible machine learning principles. Address the needs of all experience levels by providing both code-centric methods and intuitive drag-and-drop interfaces, in addition to automated machine learning solutions. Utilize robust MLOps features that integrate smoothly with existing DevOps practices, ensuring a comprehensive management of the entire ML lifecycle. Promote responsible practices by guaranteeing model interpretability and fairness, protecting data with differential privacy and confidential computing, while also maintaining a structured oversight of the ML lifecycle through audit trails and datasheets. Moreover, extend exceptional support for a wide range of open-source frameworks and programming languages, such as MLflow, Kubeflow, ONNX, PyTorch, TensorFlow, Python, and R, facilitating the adoption of best practices in machine learning initiatives. By harnessing these capabilities, organizations can significantly boost their operational efficiency and foster innovation more effectively. This not only enhances productivity but also ensures that teams can navigate the complexities of machine learning with confidence.
  • 10
    Apache Spark Reviews & Ratings

    Apache Spark

    Apache Software Foundation

    Transform your data processing with powerful, versatile analytics.
    Apache Spark™ is a powerful analytics platform crafted for large-scale data processing endeavors. It excels in both batch and streaming tasks by employing an advanced Directed Acyclic Graph (DAG) scheduler, a highly effective query optimizer, and a streamlined physical execution engine. With more than 80 high-level operators at its disposal, Spark greatly facilitates the creation of parallel applications. Users can engage with the framework through a variety of shells, including Scala, Python, R, and SQL. Spark also boasts a rich ecosystem of libraries—such as SQL and DataFrames, MLlib for machine learning, GraphX for graph analysis, and Spark Streaming for processing real-time data—which can be effortlessly woven together in a single application. This platform's versatility allows it to operate across different environments, including Hadoop, Apache Mesos, Kubernetes, standalone systems, or cloud platforms. Additionally, it can interface with numerous data sources, granting access to information stored in HDFS, Alluxio, Apache Cassandra, Apache HBase, Apache Hive, and many other systems, thereby offering the flexibility to accommodate a wide range of data processing requirements. Such a comprehensive array of functionalities makes Spark a vital resource for both data engineers and analysts, who rely on it for efficient data management and analysis. The combination of its capabilities ensures that users can tackle complex data challenges with greater ease and speed.
  • 11
    MLflow Reviews & Ratings

    MLflow

    MLflow

    Streamline your machine learning journey with effortless collaboration.
    MLflow is a comprehensive open-source platform aimed at managing the entire machine learning lifecycle, which includes experimentation, reproducibility, deployment, and a centralized model registry. This suite consists of four core components that streamline various functions: tracking and analyzing experiments related to code, data, configurations, and results; packaging data science code to maintain consistency across different environments; deploying machine learning models in diverse serving scenarios; and maintaining a centralized repository for storing, annotating, discovering, and managing models. Notably, the MLflow Tracking component offers both an API and a user interface for recording critical elements such as parameters, code versions, metrics, and output files generated during machine learning execution, which facilitates subsequent result visualization. It supports logging and querying experiments through multiple interfaces, including Python, REST, R API, and Java API. In addition, an MLflow Project provides a systematic approach to organizing data science code, ensuring it can be effortlessly reused and reproduced while adhering to established conventions. The Projects component is further enhanced with an API and command-line tools tailored for the efficient execution of these projects. As a whole, MLflow significantly simplifies the management of machine learning workflows, fostering enhanced collaboration and iteration among teams working on their models. This streamlined approach not only boosts productivity but also encourages innovation in machine learning practices.
  • 12
    Azure Databricks Reviews & Ratings

    Azure Databricks

    Microsoft

    Unlock insights and streamline collaboration with powerful analytics.
    Leverage your data to uncover meaningful insights and develop AI solutions with Azure Databricks, a platform that enables you to set up your Apache Spark™ environment in mere minutes, automatically scale resources, and collaborate on projects through an interactive workspace. Supporting a range of programming languages, including Python, Scala, R, Java, and SQL, Azure Databricks also accommodates popular data science frameworks and libraries such as TensorFlow, PyTorch, and scikit-learn, ensuring versatility in your development process. You benefit from access to the most recent versions of Apache Spark, facilitating seamless integration with open-source libraries and tools. The ability to rapidly deploy clusters allows for development within a fully managed Apache Spark environment, leveraging Azure's expansive global infrastructure for enhanced reliability and availability. Clusters are optimized and configured automatically, providing high performance without the need for constant oversight. Features like autoscaling and auto-termination contribute to a lower total cost of ownership (TCO), making it an advantageous option for enterprises aiming to improve operational efficiency. Furthermore, the platform’s collaborative capabilities empower teams to engage simultaneously, driving innovation and speeding up project completion times. As a result, Azure Databricks not only simplifies the process of data analysis but also enhances teamwork and productivity across the board.
  • 13
    Kubeflow Reviews & Ratings

    Kubeflow

    Kubeflow

    Streamline machine learning workflows with scalable, user-friendly deployment.
    The Kubeflow project is designed to streamline the deployment of machine learning workflows on Kubernetes, making them both scalable and easily portable. Instead of replicating existing services, we concentrate on providing a user-friendly platform for deploying leading open-source ML frameworks across diverse infrastructures. Kubeflow is built to function effortlessly in any environment that supports Kubernetes. One of its standout features is a dedicated operator for TensorFlow training jobs, which greatly enhances the training of machine learning models, especially in handling distributed TensorFlow tasks. Users have the flexibility to adjust the training controller to leverage either CPUs or GPUs, catering to various cluster setups. Furthermore, Kubeflow enables users to create and manage interactive Jupyter notebooks, which allows for customized deployments and resource management tailored to specific data science projects. Before moving workflows to a cloud setting, users can test and refine their processes locally, ensuring a smoother transition. This adaptability not only speeds up the iteration process for data scientists but also guarantees that the models developed are both resilient and production-ready, ultimately enhancing the overall efficiency of machine learning projects. Additionally, the integration of these features into a single platform significantly reduces the complexity associated with managing multiple tools.
  • 14
    Dask Reviews & Ratings

    Dask

    Dask

    Empower your computations with seamless scaling and flexibility.
    Dask is an open-source library that is freely accessible and developed through collaboration with various community efforts like NumPy, pandas, and scikit-learn. It utilizes the established Python APIs and data structures, enabling users to move smoothly between the standard libraries and their Dask-augmented counterparts. The library's schedulers are designed to scale effectively across large clusters containing thousands of nodes, and its algorithms have been tested on some of the world’s most powerful supercomputers. Nevertheless, users do not need access to expansive clusters to get started, as Dask also includes schedulers that are optimized for personal computing setups. Many users find value in Dask for improving computation performance on their personal laptops, taking advantage of multiple CPU cores while also using disk space for extra storage. Additionally, Dask offers lower-level APIs that allow developers to build customized systems tailored to specific needs. This capability is especially advantageous for innovators in the open-source community aiming to parallelize their applications, as well as for business leaders who want to scale their innovative business models effectively. Ultimately, Dask acts as a flexible tool that effectively connects straightforward local computations with intricate distributed processing requirements, making it a valuable asset for a wide range of users.
  • 15
    Apache Airflow Reviews & Ratings

    Apache Airflow

    The Apache Software Foundation

    Effortlessly create, manage, and scale your workflows!
    Airflow is an open-source platform that facilitates the programmatic design, scheduling, and oversight of workflows, driven by community contributions. Its architecture is designed for flexibility and utilizes a message queue system, allowing for an expandable number of workers to be managed efficiently. Capable of infinite scalability, Airflow enables the creation of pipelines using Python, making it possible to generate workflows dynamically. This dynamic generation empowers developers to produce workflows on demand through their code. Users can easily define custom operators and enhance libraries to fit the specific abstraction levels they require, ensuring a tailored experience. The straightforward design of Airflow pipelines incorporates essential parametrization features through the advanced Jinja templating engine. The era of complex command-line instructions and intricate XML configurations is behind us! Instead, Airflow leverages standard Python functionalities for workflow construction, including date and time formatting for scheduling and loops that facilitate dynamic task generation. This approach guarantees maximum flexibility in workflow design. Additionally, Airflow’s adaptability makes it a prime candidate for a wide range of applications across different sectors, underscoring its versatility in meeting diverse business needs. Furthermore, the supportive community surrounding Airflow continually contributes to its evolution and improvement, making it an ever-evolving tool for modern workflow management.
  • Previous
  • You're on page 1
  • Next