Logo

dev-resources.site

for different kinds of informations.

FLaNK-AIM Weekly 06 May 2024

Published at
5/6/2024
Categories
apachenifi
apachekafka
apacheflink
opensource
Author
tspannhw
Author
8 person written this
tspannhw
open
FLaNK-AIM Weekly 06 May 2024

06-May-2024

https://www.youtube.com/@FLaNK-Stack

FLaNK / KNIFe AI / FLaNK-AIM Weekly

http://knifeai.org/

Tim Spann @PaaSDev

https://pebble.is/PaaSDev

https://vimeo.com/flankstack

https://www.youtube.com/@FLaNK-Stack

https://www.threads.net/@tspannhw

https://medium.com/@tspann/subscribe

https://www.cloudera.com/campaign/apache-nifi-for-dummies.html

https://ossinsight.io/analyze/tspannhw

image

CODE + COMMUNITY

Please join my meetup group NJ/NYC/Philly/Virtual.

http://www.meetup.com/futureofdata-princeton/

https://www.meetup.com/futureofdata-newyork/

https://www.meetup.com/futureofdata-philadelphia/

*This is Issue #136 *

https://github.com/tspannhw/FLiPStackWeekly

https://www.cloudera.com/solutions/dim-developer.html

New Releases

Articles

https://medium.com/@tspann/small-language-models-sml-for-the-win-ea0c6fee8061

https://medium.com/@tspann/maybe-four-smaller-open-llm-s-are-better-than-one-93f78fb69eb9

https://medium.com/@tspann/building-a-milvus-connector-for-nifi-34372cb3c7fa

https://medium.com/@tspann/searching-slack-from-apache-nifi-9ed562aa2397

https://medium.com/@tspann/events-streams-flows-and-maps-22a8d27cd9b4

https://medium.com/@tspann/storing-meetup-user-data-as-events-dad3b1dc89f5

https://medium.com/@tspann/real-time-in-boston-part-1-0f92d7da3496

NSA AI Security
https://www.nsa.gov/Press-Room/Press-Releases-Statements/Press-Release-View/Article/3741371/nsa-publishes-guidance-for-strengthening-ai-system-security/

https://zilliz.com/learn/Sentence-Transformers-for-Long-Form-Text

https://zilliz.com/zilliz-cloud-pipelines

https://huggingface.co/BAAI/bge-large-en-v1.5

https://github.com/cloudevents/sdk-python/blob/main/samples/http-json-cloudevents/client.py

https://medium.com/@tspann/building-a-milvus-connector-for-nifi-34372cb3c7fa

https://docs.cloudera.com/machine-learning/cloud/release-notes/topics/ml-whats-new.html#ml_workspace_resource_tags

https://zilliz.com/blog/finding-right-fit-embedding-support-for-RAG-in-zilliz-cloud-pipelines-from-voyageai-openai-and-oss

https://hazelcast.com/glossary/streaming-data/

https://postgres.ai/blog/20220525-common-db-schema-change-mistakes

https://medium.com/cloudera-inc/consuming-rss-feeds-from-flink-sql-eaf33c1a5a23

https://medium.com/cloudera-inc/adding-generative-ai-results-to-sql-streams-513e1fd2a6af

https://www.linuxfoundation.org/press/lf-ai-data-foundation-launches-open-platform-for-enterprise-ai-opea

https://blog.mozilla.ai/local-llm-as-judge-evaluation-with-lm-buddy-prometheus-and-llamafile/

https://blog.mozilla.ai/open-source-in-the-age-of-llms/

https://www.pinecone.io/learn/structured-data/

https://medium.com/@stoty/a-bug-for-ages-fixing-time-zone-handling-in-apache-phoenix-e9934d7acd80

https://www.geeknarrator.com/blog/stream-processing/stream-processing-concepts

https://blog.cloudera.com/setting-up-and-getting-started-with-clouderas-new-sql-ai-assistant/

https://thenewstack.io/how-to-cure-llm-weaknesses-with-vector-databases/

https://dev.to/zilliz/exploring-bge-m3-and-splade-two-machine-learning-models-for-generating-sparse-embeddings-22p1

https://zilliz.com/learn/transforming-pdfs-into-insights-vectorizing-and-ingesting-with-zilliz-cloud-pipelines

https://zilliz.com/blog/how-to-evaluate-and-optimize-performance-of-milvus-storage

https://datavolo.io/2024/05/apache-nifi-designed-for-extension-at-scale/

Videos

Generative AI with Milvus
https://www.youtube.com/watch?v=IfWIzKsoHnA

Four Models at Once
https://youtu.be/xvNgsZyfo6A?si=zxwc9VcFc3o0vU3P

Search Slack
https://www.youtube.com/watch?v=3ugppfb2kN8&t=5s&ab_channel=DatainMotion-HowToBeaStreamingEngineer

MBTA Transit Live with LLM
https://www.youtube.com/watch?v=JGGY_uzQTdY&t=3s&pp=ygUOVGltIFNwYW5uIE5pRmk%3D

Events, Streams, Maps with Irish Rail
https://www.youtube.com/watch?v=14CSQRfUWoE&t=684s&pp=ygUOVGltIFNwYW5uIE5pRmk%3D

FLaNK AI Channel
https://www.youtube.com/@FLaNK-Stack

NiFi
https://www.youtube.com/watch?v=m-ZoqHOYy_k

Slides

https://www.slideshare.net/slideshow/generative-ai-on-enterprise-cloud-with-nifi-and-milvus/267678399

https://www.slideshare.net/slideshow/conf42llmadding-generative-ai-to-realtime-streaming-pipelines/267269788

https://github.com/tspannhw/FLaNK-Milvus

https://medium.com/cloudera-inc/building-a-milvus-connector-for-nifi-34372cb3c7fa

https://www.slideshare.net/slideshow/generative-ai-on-enterprise-cloud-with-nifi-and-milvus/267678399

https://www.youtube.com/watch?v=ssoM5S87BBs

Events

May 8-9, 2024: Data Summit 2024. Boston, MA.
https://www.dbta.com/DataSummit/2024/default.aspx
https://www.dbta.com/DataSummit/2024/Timothy-Spann.aspx

https://twitter.com/DBTADataSummit/status/1778393005646397636

May 21, 2024: Gen AI and Beyond with NiFi 2.0. Virtual.

May 30, 2024: Conf42: Machine learning
https://www.conf42.com/Machine_Learning_2024_Tim_Spann_enriching_generative_events

June 12, 2024: Budapest Data + ML Forum. Virtual.
image
https://budapestml.hu/2024/en/speakers/

June 20, 2024: AI Camp Meetup. NYC.

Sept 24, 2024: JConf.Dev. Dallas.
https://2024.jconf.dev/session/598816

Nov 5-7, 10-12, 2024: CloudX. Online/Santa Clara. https://www.developerweek.com/cloudx/

Nov 19, 2024: XtremePython. Online.
https://xtremepython.dev/2024/

tim_v2_1200_628python

Cloudera Events
https://www.cloudera.com/about/events.html

https://www.cloudera.com/events/cloudera-now-cdp.html?internal_keyplay=ALL&internal_campaign=FY25-Q1-AMER-WS-Cloudera-Now-Events-Page-P06&cid=701Hr000000tW6qIAE&internal_link=p06

More Events:
https://www.linkedin.com/pulse/schedule-2024-tim-spann--y4coe

Code

Models

Tools

Cool Tool

Convert Spark SQL to Trino SQL
https://github.com/linkedin/coral

Discount

Discount access to DataSummit 2024
https://secure.infotoday.com/RegForms/DataSummit/?Priority=24SPKR

Β© 2020-2024 Tim Spann https://www.youtube.com/@FLaNK-Stack
FLaNK-AIM with LLAMAΒ 3

apachekafka Article's
30 articles in total
Favicon
Mastering Apache Kafka: A Complete Guide to the Heart of Real-Time Data Streaming
Favicon
AIM Weekly for 11/11/2024
Favicon
Apache Kafka: A Simple Guide to Messaging and Streaming
Favicon
Design a real-time data processing
Favicon
Building a Scalable Data Pipeline with Apache Kafka
Favicon
Building a Scalable Data Pipeline with Apache Kafka
Favicon
Implementing AI with Scikit-Learn and Kafka: A Complete Guide
Favicon
Understanding the Importance of Kafka in High-Volume Data Environments
Favicon
How can i stop my kafka consumer from consuming messages ?
Favicon
Getting Started with Apache Kafka: A Beginner's Guide to Distributed Event Streaming
Favicon
πŸš€ Apache Kafka Cluster Explained: Core Concepts and Architectures 🌐
Favicon
WarpStream Newsletter #5: Dealing with Rejection, Schema Validation, and Time Lag
Favicon
Dealing with rejection (in distributed systems)
Favicon
Apache Kafka on Amazon Linux EC2
Favicon
Announcing WarpStream Schema Validation
Favicon
The Kafka Metric You’re Not Using: Stop Counting Messages, Start Measuring Time
Favicon
WarpStream Newsletter #4: Data Pipelines, Zero Disks, BYOC and More
Favicon
Integrating Apache Kafka with Apache AGE for Real-Time Graph Processing
Favicon
Integrating Apache Kafka with Apache AGE for Real-Time Graph Processing
Favicon
Multiple Regions, Single Pane of Glass
Favicon
FLaNK-AIM: 20 May 2024 Weekly
Favicon
Secure by default: How WarpStream’s BYOC deployment model secures the most sensitive workloads
Favicon
Zero Disks is Better (for Kafka)
Favicon
FLaNK AI-April 22,Β 2024
Favicon
Pixel Federation Powers Mobile Analytics Platform with WarpStream, saves 83% over MSK
Favicon
FLaNK AI - 15 April 2024
Favicon
WarpStream Newsletter #3: Always Be Shipping
Favicon
Introducing WarpStream Managed Data Pipelines for BYOC Clusters
Favicon
Apache Kafka
Favicon
FLaNK-AIM Weekly 06 May 2024

Featured ones: