GenAI

699 bookmarks

Newest

How AppFolio transformed property management workflows with Realm-X, built using LangGraph and LangSmith

See how AppFolio's AI-powered copilot Realm-X has saved property managers over 10 hours per week. Learn how they improved Realm-X's performance 2x using LangSmith and built an agent architecture with LangGraph.

Example #ai-copilot

·blog.langchain.dev·Dec 16, 2024

How AppFolio transformed property management workflows with Realm-X, built using LangGraph and LangSmith

Running Neo4j’s LLM Graph Builder with Flox

Neo4j’s LLM Graph Builder is an app for automatically constructing knowledge graphs from unstructured data sources. It can be run locally…

Tutorial #neo4j

·medium.com·Dec 16, 2024

Running Neo4j’s LLM Graph Builder with Flox

Can LLMs Convert Graphs to Text-Attributed Graphs?

Graphs are ubiquitous data structures found in numerous real-world applications, such as drug discovery, recommender systems, and social network analysis. Graph neural networks (GNNs) have become...

Paper

·arxiv.org·Dec 16, 2024

Can LLMs Convert Graphs to Text-Attributed Graphs?

Evaluating Quality in Large Language Models: A Comprehensive Approach using the legal industry as a…

Evaluating the quality of outputs from Large Language Models (LLMs) is an intricate task due to the open-ended nature of many LLM tasks…

Concept

·medium.com·Dec 16, 2024

Evaluating Quality in Large Language Models: A Comprehensive Approach using the legal industry as a…

This weekend learn how to build a legal document agent from scratch 👨‍⚖️📑

I made a tutorial showing you how to build a contract review agentic workflow - given a vendor agreement, parse it into a set of key clauses, match it with relevant clauses from a set of guidelines (GDPR),… — Jerry Liu (@jerryjliu0)

Tutorial

·x.com·Dec 15, 2024

This weekend learn how to build a legal document agent from scratch 👨‍⚖️📑

Practical Text-to-SQL for Data Analytics

Example #text-to-sql

·linkedin.com·Dec 15, 2024

Practical Text-to-SQL for Data Analytics

The Problem with Reasoners

A new tool that blends your everyday work apps into one. It's the all-in-one workspace for you and your team

Concept

·aidanmclaughlin.notion.site·Dec 11, 2024

The Problem with Reasoners

From PDFs to AI-ready structured data: a deep dive · Explosion

This blog post presents a new modular workflow for converting PDFs and similar documents to structured data and shows you how to build end-to-end document understanding and information extraction pipelines for industry use cases.

Deep Dive #document-understanding

·explosion.ai·Dec 11, 2024

From PDFs to AI-ready structured data: a deep dive · Explosion

How to Count Tokens - Tokenization With Tiktoken.

Counting tokens is a useful task in natural language processing (NLP) that allows us to measure the length and complexity of a text. The two important use cases for counting the tokens are: controlling the length of the prompt - models has limit …

Tutorial

·safjan.com·Dec 11, 2024

How to Count Tokens - Tokenization With Tiktoken.

In context scheming reasoning paper

Security #explainability #interpretability

·static1.squarespace.com·Dec 11, 2024

In context scheming reasoning paper

Struggling to keep up with new RAG variants?

Here’s a cheat sheet of 7 of the most popular RAG architectures. Which variants did we miss? — Weaviate • vector database (@weaviate_io)

Concept #knowledge-graph

·x.com·Dec 10, 2024

Struggling to keep up with new RAG variants?

GraphRAG in Action: From Commercial Contracts to a Dynamic Q&A Agent

A question-based extraction approach

Design Pattern #knowledge-graph

·towardsdatascience.com·Dec 10, 2024

GraphRAG in Action: From Commercial Contracts to a Dynamic Q&A Agent

LangChain Neo4j Integration - Neo4j Labs

Awesome guide with templates

Package #knowledge-graph #langchain #neo4j

·neo4j.com·Dec 10, 2024

LangChain Neo4j Integration - Neo4j Labs

A Multi-Agent Framework for Synthetic Data Generation

Presents MAG-V, a multi-agent framework that first generates a dataset of questions that mimic customer queries. It then reverse engineer alternate questions from responses to verify agent trajectories. Reports that the… — elvis (@omarsar0)

Concept

·x.com·Dec 9, 2024

A Multi-Agent Framework for Synthetic Data Generation

Agentless is a great example of how a more constrained agent is better than a general agent for specific tasks 💡 - it achieves much higher scores on SWE-Bench Lite for bug-fixing than other agent approaches 🛠️

The whole point is to not let the agent do everything, but to do a… — Jerry Liu (@jerryjliu0)

Concept

·x.com·Dec 9, 2024

(12) Pedro Domingos on X: "Calling an LLM an agent doesn’t suddenly make it more intelligent." / X

— Pedro Domingos (@pmddomingos)

Concept

·x.com·Dec 6, 2024

(12) Pedro Domingos on X: "Calling an LLM an agent doesn’t suddenly make it more intelligent." / X

ZenML - LLMOps Database

List of solutions

Handbooks

·zenml.io·Dec 4, 2024

ZenML - LLMOps Database

nategro (Nathanael)

User profile of Nathanael on Hugging Face

Package

·huggingface.co·Dec 2, 2024

nategro (Nathanael)

PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance...

This study provides an efficient approach for using text data to calculate patent-to-patent (p2p) technological similarity, and presents a hybrid framework for leveraging the resulting p2p...

Paper

·arxiv.org·Dec 2, 2024

PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance...

TRIZ Technical Contradiction Extraction Method Based on Patent Semantic Space Mapping | Proceedings of the 2020 11th International Conference on E-business, Management and Economics

Paper

·dl.acm.org·Dec 2, 2024

TRIZ Technical Contradiction Extraction Method Based on Patent Semantic Space Mapping | Proceedings of the 2020 11th International Conference on E-business, Management and Economics

A Hierarchical Feature Extraction Model for Multi-Label Mechanical Patent Classification

Various studies have focused on feature extraction methods for automatic patent classification in recent years. However, most of these approaches are based on the knowledge from experts in related domains. Here we propose a hierarchical feature extraction model (HFEM) for multi-label mechanical patent classification, which is able to capture both local features of phrases as well as global and temporal semantics. First, a n-gram feature extractor based on convolutional neural networks (CNNs) is designed to extract salient local lexical-level features. Next, a long dependency feature extraction model based on the bidirectional long–short-term memory (BiLSTM) neural network model is proposed to capture sequential correlations from higher-level sequence representations. Then the HFEM algorithm and its hierarchical feature extraction architecture are detailed. We establish the training, validation and test datasets, containing 72,532, 18,133, and 2679 mechanical patent documents, respectively, and then check the performance of HFEMs. Finally, we compared the results of the proposed HFEM and three other single neural network models, namely CNN, long–short-term memory (LSTM), and BiLSTM. The experimental results indicate that our proposed HFEM outperforms the other compared models in both precision and recall.

Paper

·mdpi.com·Dec 2, 2024

A Hierarchical Feature Extraction Model for Multi-Label Mechanical Patent Classification

DAIR.AI

Learn important prompt engineering techniques to build use cases with LLMs.

Tutorial

·dair-ai.thinkific.com·Dec 2, 2024

DAIR.AI

NCOSE Guide to Writing Requirements V4 – Summary Sheet

Concept

·incose.org·Nov 30, 2024

NCOSE Guide to Writing Requirements V4 – Summary Sheet

LLM-based Extraction of Contradictions from Patents

Paper

·arxiv.org·Nov 30, 2024

LLM-based Extraction of Contradictions from Patents

Vector Similarity: Going Beyond Full-Text Search | Qdrant - Qdrant

Discover how vector similarity expands data exploration beyond full-text search. Explore diversity sampling and more for enhanced data discovery!

Concept #vector-similarity

·qdrant.tech·Nov 28, 2024

nategro/contradiction-psb · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Package

·huggingface.co·Nov 28, 2024

nategro/contradiction-psb · Hugging Face

Fundamental Research on Detecting Contradictions in Requirements: Taxonomy and Semi-Automated Approach

Requirements documents can contain several thousand individual requirements. They must be error-free to avoid unnecessary complications and costs in the later product development stages. An important part of this is to identify contradictions between two requirements. The first step is therefore to define what contradictions are and in what form they can occur in requirement documents. In this paper the scientific theories regarding contradictions are discussed, concerning to their usefulness for the topic. In doing so, the Aristotelian Logic proved to provide the best basis for an application in the Requirements Engineering context. Based on this theory, we have created specific subtypes of contradictions to match them to the requirements engineering field. The identification of these subtypes is done by a formalization of the requirement sentences and a subsequent analysis by means of simple questions. To validate the method, industrial requirement documents were searched for contradictions. For each detected type of contradiction, we present an example of the detection process. Thereby, we show that the method is easy to apply and may also be used by non-specialists. Thus, our method provides a taxonomy as a basis for further research on automated contradiction detection as well as on automated quality analysis of requirements documents.

Paper

·mdpi.com·Nov 28, 2024

Fundamental Research on Detecting Contradictions in Requirements: Taxonomy and Semi-Automated Approach

Finding Contradictions in Text

Paper

·nlp.stanford.edu·Nov 28, 2024

Finding Contradictions in Text

Check grounding with RAG | Vertex AI Agent Builder | Google Cloud

Check grounding with RAG

Tool

·cloud.google.com·Nov 28, 2024

Check grounding with RAG | Vertex AI Agent Builder | Google Cloud

ContraDoc: Understanding Self-Contradictions in Documents with Large Language Models | AI Research Paper Details

In recent times, large language models (LLMs) have shown impressive performance on various document-level tasks such as document classification, summarization, and question-answering. However, research on understanding their capabilities on the task of self-contradictions in long documents has been very limited. In this work, we introduce ContraDoc, the first human-annotated dataset to study self-contradictions in long documents across multiple domains, varying document lengths, self-contradictions types, and scope. We then analyze the current capabilities of four state-of-the-art open-source and commercially available LLMs: GPT3.5, GPT4, PaLM2, and LLaMAv2 on this dataset. While GPT4 performs the best and can outperform humans on this task, we find that it is still unreliable and struggles with self-contradictions that require more nuance and context. We release the dataset and all the code associated with the experiments (https://github.com/ddhruvkr/CONTRADOC).

Paper

·aimodels.fyi·Nov 28, 2024

ContraDoc: Understanding Self-Contradictions in Documents with Large Language Models | AI Research Paper Details