Unicode कन्वर्टर

Unicode कन्वर्टर क्या है?

यह टूल text को Unicode code points में convert करता है और वापस। UTF-8/UTF-16 bytes inspect कर सकते हैं। हिंदी, चीनी, इमोजी सब handle करता है। Encoding issues debug करने के लिए ज़रूरी।

टूल का उपयोग कैसे करें

Text type/paste करें।
Code points (U+XXXX format) तुरंत देखें।
UTF-8 और UTF-16 byte representation भी मिलेगा।
Reverse: code points से text में convert करें।

मुख्य उपयोग

Encoding mismatch debug करना
Emoji के bytes inspect करना
Database में text storage समझना
API integrations में character escaping

अक्सर पूछे जाने वाले प्रश्न

Unicode क्या है?

Unicode दुनिया की हर language, script, emoji के लिए unique numeric code देता है। 1,40,000+ characters — हिंदी, चीनी, अरबी, इमोजी, गणितीय symbols सब। UTF-8 इसका encoding है — internet पर 98% इस्तेमाल होता है।

UTF-8 और UTF-16 में क्या फर्क है?

UTF-8: variable-length (1-4 bytes), ASCII compatible — web standard। UTF-16: 2 या 4 bytes — Windows internal, Java, JavaScript के स्ट्रिंग्स। UTF-32: fixed 4 bytes — कम popular। Web पर UTF-8 हमेशा prefer करें।

Code point क्या होता है?

हर Unicode character का unique number। उदाहरण: 'A' = U+0041, 'अ' = U+0905, '😀' = U+1F600। Hex में लिखे जाते हैं U+ prefix के साथ। यह टूल character ↔ code point ↔ UTF-8 bytes सब convert करता है।

Surrogate pairs क्या हैं?

BMP (Basic Multilingual Plane, U+0000 to U+FFFF) में 65,536 characters। उससे ऊपर के characters (इमोजी, ancient scripts) UTF-16 में 2 surrogate pairs से represent होते हैं। JavaScript strings UTF-16 में हैं — emoji की length 2 आती है।