Skip to main content

M4 Series: Semantic Transfer & Universal Meaning

Overview

The M4 series investigates whether HDC representations capture universal meaning that transcends languages and preserves semantic structure through extreme compression.

Key findings:

  • Meaning is language-agnostic (91.3% cross-lingual transfer)
  • Ternary quantization improves semantic arithmetic (110% retention)
  • HDC competitive with Knowledge Distillation (98.4%) with unique properties

M4c: Cross-Lingual Transfer

Hypothesis

If HDC captures meaning rather than surface patterns, representations learned from one language should transfer to others without retraining.

Setup

  • Dataset: XNLI (Cross-lingual Natural Language Inference)
  • Training: English only (10,000 examples)
  • Testing: 10 languages (500 examples each)
  • Encoder: paraphrase-multilingual-mpnet-base-v2
  • HDC: 16384d ternary, Two-Vector approach

Results

LanguageAccuracyTransfer Ratio
English (train)64.8%baseline
Spanish62.8%96.9%
German61.6%95.1%
French60.6%93.5%
Bulgarian59.6%92.0%
Chinese59.4%91.7%
Vietnamese59.2%91.4%
Russian57.8%89.2%
Arabic56.6%87.3%
Hindi54.8%84.6%
Average59.2%91.3%

Conclusion

HDC representations trained on English achieve 91.3% of their performance when tested on typologically diverse languages including Chinese, Arabic, and Hindi. Meaning is universal.


M4d: Semantic Compositionality

Hypothesis

If HDC preserves semantic structure, vector arithmetic should produce meaningful results (king - man + woman = queen).

Setup

  • Task: 12 word analogies (classic word2vec set)
  • Vocabulary: 71 words with distractors
  • Comparison: Original embeddings → Float HDC → Ternary HDC

Results

MethodTop-1Top-5
Original embeddings (768d)67%83%
Float HDC (4096d)67%83%
Ternary HDC (4096d)75%92%

Retention rate: 110% — Ternary is better than original.

Working Analogies

  • king - man + woman = queen ✅
  • paris - france + germany = berlin ✅
  • tokyo - japan + france = paris ✅
  • walked - walk + swim = swam ✅
  • bigger - big + small = smaller ✅

Conclusion

Ternary quantization acts as regularization, removing noise and strengthening semantic signal. HDC captures genuine meaning, not patterns.


M4e: HDC vs Knowledge Distillation

Hypothesis

HDC transfer should be competitive with standard Knowledge Distillation while providing unique properties.

Setup

  • Task: SST-2 Sentiment Classification
  • Teacher: all-mpnet-base-v2 + classifier
  • Standard KD: Small NN (64 hidden) trained on soft labels
  • HDC Transfer: 4096d ternary + classifier

Results

MethodAccuracyCross-LingualArithmetic
Teacher89.0%
Standard KD88.6%NoNo
Tiny KD88.3%NoNo
HDC Transfer87.3%91%110%

HDC vs KD: 98.4%

Conclusion

HDC achieves 98.4% of KD accuracy while providing:

  • Cross-lingual transfer (91.3%)
  • Semantic arithmetic (110%)
  • 32× compression (ternary vs float32)
  • Edge deployment capability

KD compresses the model. HDC transfers the meaning.


Implications for SEP

These experiments validate core SEP claims:

  1. Semantic events can be understood universally — cross-lingual transfer proves meaning transcends language
  2. Meaning survives extreme compression — ternary quantization preserves (and improves) semantic structure
  3. HDC is competitive with standard approaches — no accuracy sacrifice for unique properties

Reproducibility

All experiments available at: https://github.com/nick-yudin/SEP/tree/main/experiments

  • m4c_crosslingual/ — Cross-lingual experiment
  • m4d_compositionality/ — Semantic arithmetic
  • m4e_hdc_vs_kd/ — KD comparison