Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
7.4
TFLOPS
3
34
16
Simon Pagezy
pagezyhf
Follow
reach-vb's profile picture
Akash20000's profile picture
davanstrien's profile picture
15 followers
·
24 following
pagezyhf
AI & ML interests
Healthcare ML
Recent Activity
replied
to
singhsidhukuldeep
's
post
1 day ago
Excited to share insights from Walmart's groundbreaking semantic search system that revolutionizes e-commerce product discovery! The team at Walmart Global Technology(the team that I am a part of 😬) has developed a hybrid retrieval system that combines traditional inverted index search with neural embedding-based search to tackle the challenging problem of tail queries in e-commerce. Key Technical Highlights: • The system uses a two-tower BERT architecture where one tower processes queries and another processes product information, generating dense vector representations for semantic matching. • Product information is enriched by combining titles with key attributes like category, brand, color, and gender using special prefix tokens to help the model distinguish different attribute types. • The neural model leverages DistilBERT with 6 layers and projects the 768-dimensional embeddings down to 256 dimensions using a linear layer, achieving optimal performance while reducing storage and computation costs. • To improve model training, they implemented innovative negative sampling techniques combining product category matching and token overlap filtering to identify challenging negative examples. Production Implementation Details: • The system uses a managed ANN (Approximate Nearest Neighbor) service to enable fast retrieval, achieving 99% recall@20 with just 13ms latency. • Query embeddings are cached with preset TTL (Time-To-Live) to reduce latency and costs in production. • The model is exported to ONNX format and served in Java, with custom optimizations like fixed input shapes and GPU acceleration using NVIDIA T4 processors. Results: The system showed significant improvements in both offline metrics and live experiments, with: - +2.84% improvement in NDCG@10 for human evaluation - +0.54% lift in Add-to-Cart rates in live A/B testing This is a fantastic example of how modern NLP techniques can be successfully deployed at scale to solve real-world e-
reacted
to
singhsidhukuldeep
's
post
with 🤯
1 day ago
Excited to share insights from Walmart's groundbreaking semantic search system that revolutionizes e-commerce product discovery! The team at Walmart Global Technology(the team that I am a part of 😬) has developed a hybrid retrieval system that combines traditional inverted index search with neural embedding-based search to tackle the challenging problem of tail queries in e-commerce. Key Technical Highlights: • The system uses a two-tower BERT architecture where one tower processes queries and another processes product information, generating dense vector representations for semantic matching. • Product information is enriched by combining titles with key attributes like category, brand, color, and gender using special prefix tokens to help the model distinguish different attribute types. • The neural model leverages DistilBERT with 6 layers and projects the 768-dimensional embeddings down to 256 dimensions using a linear layer, achieving optimal performance while reducing storage and computation costs. • To improve model training, they implemented innovative negative sampling techniques combining product category matching and token overlap filtering to identify challenging negative examples. Production Implementation Details: • The system uses a managed ANN (Approximate Nearest Neighbor) service to enable fast retrieval, achieving 99% recall@20 with just 13ms latency. • Query embeddings are cached with preset TTL (Time-To-Live) to reduce latency and costs in production. • The model is exported to ONNX format and served in Java, with custom optimizations like fixed input shapes and GPU acceleration using NVIDIA T4 processors. Results: The system showed significant improvements in both offline metrics and live experiments, with: - +2.84% improvement in NDCG@10 for human evaluation - +0.54% lift in Add-to-Cart rates in live A/B testing This is a fantastic example of how modern NLP techniques can be successfully deployed at scale to solve real-world e-
liked
a model
1 day ago
Qwen/QVQ-72B-Preview
View all activity
Articles
Hugging Face models in Amazon Bedrock
26 days ago
•
10
Introducing HUGS - Scale your AI with Open Models
Oct 23, 2024
•
36
Deploy Meta Llama 3.1 405B on Google Cloud Vertex AI
Aug 19, 2024
•
18
Google Cloud TPUs made available to Hugging Face users
Jul 9, 2024
•
19
Introducing Spaces Dev Mode for a seamless developer experience
May 21, 2024
•
14
Organizations
pagezyhf
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
Datou1111/shou_xin
15 days ago
Add generated example
#9 opened 15 days ago by
pagezyhf
New activity in
huggingface/HuggingDiscussions
2 months ago
[FEEDBACK] Follow
4
#14 opened about 1 year ago by
victor
New activity in
aws-neuron/optimum-neuron-cache
8 months ago
[Cache Request] meta-llama/Meta-Llama-3-8B
1
#71 opened 8 months ago by
sandkoan