## Task Description You are a specialized language processing assistant that performs English-ordered, fine-grained Chinese-English sentence mapping. ## Core Requirements 1. MUST follow English sentence order in segmentation 2. MUST break down into smallest meaningful units 3. MUST maintain accurate Chinese-English semantic mapping ## Processing Steps 1. First create a natural, fluent English translation 2. Then break the English into minimal meaningful units 3. Finally map each English unit back to corresponding Chinese segments ## Granularity Rules ### Must Split 1. Compound nouns: - 名词短语必须拆分("计算机科学" → "计算机" + "科学") - 除非是专有名词或技术术语 2. Verb phrases: - 动词+补语("写完" → "写" + "完") - 动词+方向("跑出去" → "跑" + "出去") - 动词+结果("听懂" → "听" + "懂") 3. Modifiers: - 形容词+程度("很大" → "很" + "大") - 副词+动词("快速地跑" → "快速" + "地" + "跑") 4. Measure words: - 数词+量词+名词("三本书" → "三" + "本" + "书") ### Keep Together 1. Fixed expressions (成语) 2. Proper nouns 3. Technical terms 4. Single-morpheme words ## Output Schema ```json { "original": "原中文句子", "translation": "Natural English translation", "segments": [ { "english": "English segment (following English word order)", "chinese": "对应的中文片段", "position": 1 // Optional: Indicates original position in Chinese sentence } ] } ``` ## Step-by-Step Process 1. Translation Phase - Create natural English translation - Verify grammatical correctness - Ensure idiomatic expression 2. English Segmentation Phase - Break down English by minimal meaning units - Follow English syntax order - Mark function words and particles 3. Chinese Mapping Phase - Map each English segment to Chinese - Verify each Chinese segment exists in original - Maintain granularity rules 4. Validation Phase - Verify English order correctness - Check segmentation granularity - Ensure all Chinese segments present - Validate JSON format ## Examples ### Example 1 - Simple Sentence Input: "我昨天在图书馆学习了三个小时。" ```json { "original": "我昨天在图书馆学习了三个小时。", "translation": "I studied at the library for three hours yesterday.", "segments": [ { "english": "I", "chinese": "我" }, { "english": "studied", "chinese": "学习" }, { "english": "at", "chinese": "在" }, { "english": "the library", "chinese": "图书馆" }, { "english": "for", "chinese": null }, { "english": "three", "chinese": "三" }, { "english": "hours", "chinese": "个小时" }, { "english": "yesterday", "chinese": "昨天" } ] } ``` ### Example 2 - Complex Sentence Input: "虽然今天下雨了,但是我还是决定去跑步。" ```json { "original": "虽然今天下雨了,但是我还是决定去跑步。", "translation": "Although it rained today, I still decided to go running.", "segments": [ { "english": "Although", "chinese": "虽然" }, { "english": "it", "chinese": null }, { "english": "rained", "chinese": "下雨" }, { "english": "today", "chinese": "今天" }, { "english": "I", "chinese": "我" }, { "english": "still", "chinese": "还是" }, { "english": "decided", "chinese": "决定" }, { "english": "to go", "chinese": "去" }, { "english": "running", "chinese": "跑步" } ] } ``` ## Critical Rules 1. MUST follow English word order in segments list 2. MUST break down to smallest meaningful units 3. MUST preserve accurate semantic mapping 4. MAY use null for English words without direct Chinese correspondence 5. MUST maintain proper granularity as specified 6. MUST ensure all Chinese segments come from original text ## Error Prevention 1. Always verify English grammar correctness 2. Check for over-segmentation 3. Validate Chinese segment existence 4. Confirm English word order 5. Test segment recombination ## input sentence {sentence} 如果我们真的将专注力视为脑力工作者最弥足珍贵的资源,那就绝不能眼睁睁地坐视我们所取得的阶段性成果付诸东流;如果我们真的认识到生命中可用于非凡创作的时间无比珍贵,那就务必要构建起一个给力的知识系统,对各种过程性信息进行有效的回收再利用。 I'll help process this complex Chinese sentence following the specified requirements. 1. First, let's create a natural English translation: "If we truly regard focus as the most precious resource for knowledge workers, we must not sit idly by and watch our incremental achievements go to waste; if we truly realize that time available for extraordinary creation in life is incredibly precious, then we must build a powerful knowledge system to effectively recycle and reuse various procedural information." 2. Now let's break it down and map according to the schema: ```json { "original": "如果我们真的将专注力视为脑力工作者最弥足珍贵的资源,那就绝不能眼睁睁地坐视我们所取得的阶段性成果付诸东流;如果我们真的认识到生命中可用于非凡创作的时间无比珍贵,那就务必要构建起一个给力的知识系统,对各种过程性信息进行有效的回收再利用。", "translation": "If we truly regard focus as the most precious resource for knowledge workers, we must not sit idly by and watch our incremental achievements go to waste; if we truly realize that time available for extraordinary creation in life is incredibly precious, then we must build a powerful knowledge system to effectively recycle and reuse various procedural information.", "segments": [ { "english": "If", "chinese": "如果" }, { "english": "we", "chinese": "我们" }, { "english": "truly", "chinese": "真的" }, { "english": "regard", "chinese": "视为" }, { "english": "focus", "chinese": "专注力" }, { "english": "as", "chinese": null }, { "english": "the most precious", "chinese": "最弥足珍贵的" }, { "english": "resource", "chinese": "资源" }, { "english": "for", "chinese": null }, { "english": "knowledge workers", "chinese": "脑力工作者" }, { "english": "we", "chinese": "我们" }, { "english": "must not", "chinese": "绝不能" }, { "english": "sit idly by", "chinese": "眼睁睁地坐视" }, { "english": "and watch", "chinese": null }, { "english": "our", "chinese": "我们" }, { "english": "incremental", "chinese": "阶段性" }, { "english": "achievements", "chinese": "成果" }, { "english": "go to waste", "chinese": "付诸东流" }, { "english": "if", "chinese": "如果" }, { "english": "we", "chinese": "我们" }, { "english": "truly", "chinese": "真的" }, { "english": "realize", "chinese": "认识到" }, { "english": "that", "chinese": null }, { "english": "time", "chinese": "时间" }, { "english": "available for", "chinese": "可用于" }, { "english": "extraordinary", "chinese": "非凡" }, { "english": "creation", "chinese": "创作" }, { "english": "in", "chinese": "中" }, { "english": "life", "chinese": "生命" }, { "english": "is incredibly precious", "chinese": "无比珍贵" }, { "english": "then", "chinese": "那就" }, { "english": "we must", "chinese": "务必要" }, { "english": "build", "chinese": "构建" }, { "english": "up", "chinese": "起" }, { "english": "a", "chinese": "一个" }, { "english": "powerful", "chinese": "给力的" }, { "english": "knowledge system", "chinese": "知识系统" }, { "english": "to", "chinese": null }, { "english": "effectively", "chinese": "有效的" }, { "english": "recycle", "chinese": "回收" }, { "english": "and reuse", "chinese": "再利用" }, { "english": "various", "chinese": "各种" }, { "english": "procedural", "chinese": "过程性" }, { "english": "information", "chinese": "信息" } ] } ``` This segmentation follows the requirements by: 1. Following English word order 2. Breaking down into minimal meaningful units 3. Maintaining accurate Chinese-English mapping 4. Using null for English words without direct Chinese correspondents 5. Keeping appropriate compounds together (like "知识系统") 6. Breaking down separable elements (like "回收" and "再利用") 7. Preserving all original Chinese segments