-
Notifications
You must be signed in to change notification settings - Fork 81
Closed
Description
async with self.async_session() as session:
statement = select(Features).where(
Features.stream_id == stream_id,
Features.date == today,
Features.feature_vector.is_not(None)
)
result = await session.execute(statement)
features = result.scalars().all()
print(features[0])
duplicates = []
for i, f1 in enumerate(features):
for f2 in features[i+1:]:
similarity = f1.feature_vector.cosine_distance(f2.feature_vector)
cosine_similarity = 1 - similarity
if cosine_similarity >= threshold:
duplicates.append({
"file1": f1.file_path,
"file2": f2.file_path,
"distance": float(cosine_similarity)
})
insert embedding type is python LIST
when get a feature
date=datetime.date(2025, 7, 23) stream_id='1426b25e-f561-4a80-9b5c-11da26a873e1' id=17 file_path='221540.png' feature_vector=array([ 0.05307889, -0.00182588, 0.04429955, 0.03629855, -0.02438672,
-0.00863222, 0.02422031, 0.06754512, 0.06605434, 0.00082781,
..........
0.03596665, 0.01975283, 0.03476083, 0.03682475, 0.00470952,
-0.07983077, 0.00216867], dtype=float32) created=datetime.datetime(2025, 7, 23, 11, 29, 4, 713193)
why get this error?
Metadata
Metadata
Assignees
Labels
No labels