🍋
Menu
Text

Stemming

Word Stemming

Reducing words to their root form by removing suffixes (e.g. 'running' to 'run') for text analysis.

รายละเอียดทางเทคนิค

Stemming operates on sequences of Unicode code points, where each character's properties (category, script, case, directionality) are defined by the Unicode standard. Text processing in the browser uses the TextEncoder/TextDecoder APIs for encoding conversion and Intl.Segmenter for locale-aware word and sentence boundary detection. Understanding the distinction between bytes, code units, code points, and grapheme clusters is essential for correct text manipulation.

ตัวอย่าง

```javascript
// Stemming: text processing example
const input = 'Sample text for processing';
const result = input
  .trim()
  .split(/\s+/)
  .filter(Boolean);
console.log(result); // ['Sample', 'text', 'for', 'processing']
```

เครื่องมือที่เกี่ยวข้อง

คำศัพท์ที่เกี่ยวข้อง