An Annotated Chinese Health Question Corpus
|
本标注规则应与本网站提供的“公众健康问句分类体系”配套使用。具体规则如下:
1. 由于部分问句来源网站提供了一个提问模板来引导公众输入问题。因此,如果提问消息中有“想得到怎样的帮助”等提示句,则以该短语后的第一个问题为主要分类;后续各问题为次要分类。如果没有“想得到怎样的帮助”等提示句,则以整个提问消息中的第一个问题为主要分类,后续各问题为次要分类。
2. 对于一个“问句”,每个类别最多标注一次。当一个“问句”里有多个“子问句”时,如果这些子问句同属于一个类别,则仅标注一个代码;如果这些子问句不属于同一个类别时,则需要将所需的各类别分别标注,并将第一个子问题的类别标注为主要代码,后续各子问题为次要代码。
3. 对于没有提问语句/提问词的问题:①仅提供临床表现或病情,诊断未知,归为A01(诊断→病因/临床发现的解释)和B06(治疗→不限于但可能包含药物治疗);②提供一个或多个疾病名称视为诊断已知,未提及治疗措施或提到的治疗措施不只包括药物治疗,归为B06(治疗→不限于但可能包含药物治疗);③诊断已知,且仅提及药物治疗,归为B02(治疗→药物选择/适应症/效力);④诊断已知,提及用药一段时间后恢复正常,归为B01(治疗→药物用法用量),意指是否可以停药了。
4. A01(诊断→病因/临床发现的解释): 提问者提及一系列临床发现想知道它是由什么情况引起的。即知道临床发现,但不知道是什么情况。注意提到没有某临床发现,也属于一种临床发现。临床发现包括症状、体征、检验检查发现等。
4.1症状指疾病过程中机体内的一系列机能、代谢和形态结构异常变化所引起的病人主观上的异常感觉称为症状,如疼痛,不适,畏寒等。
4.2体征指能用体格检查的方法检出的异常变化所引起的现象,如心脏杂音,肺部罗音,血压(升)高,反射异常等。
4.3临床检验(检查)是将病人的血液、体液、分泌物、排泄物和脱落物等标本,通过目视观察、物理、化学、仪器或分子生物学方法等进行检测,从而为临床、为病人提供有价值的实验资料。
5. A02(诊断→疾病诊断标准/疾病的临床表现): 提问者从某种情况开始,想知道临床发现X1,X2,…Xn是否是该情况/疾病的临床表现,或者想知道根据临床发现X1,X2…,是否能确诊或排除该情况/疾病。
6. A03(诊断→检验检查): 提问者主要关注做某项检查的适应症、精确度、时间选择、方法等,而不是对检验检查结果的解释(参见A01)。检验检查包括体格检查、心电图、皮试、活组织切片检查、B超、X光、CT、MRI等。
7. A04(诊断→疾病/情况介绍): 提问者知道该疾病/情况的名称,但不知道它是什么。该类不用于任何分析性问题。
8. A99(诊断→其他有关诊断的问题):如诊断相关费用等。
9. B01(治疗→药物用法用量): 提问者知道应该用什么药,但是不知道其用法用量及用药时间等。
10. B02(治疗→药物选择/适应症/效力):包括治疗性用药选择和预防性用药选择。
11. 对于诸如“请问用什么方法来调理?”之类的问题,如果提到了药物类,别,则归为B02(治疗→药物选择/适应症/效力);如果没有提到药物,则归为E99(健康生活方式→未特指的)。
12. B03(治疗→药物副作用/不良反应): 主要用于药物X副作用不明确,问“有副作用吗?会引起情况Y吗?情况Y跟它有关系吗?”等问题。
13. B04(治疗→用药禁忌/注意事项):用于药物X副作用不明确,问“对生育、母乳喂养等有影响吗?某情况能用药物X吗?”等的问题。以及用于已知药物X有副作用,问“还能吃吗?能停吗?怎么降低副作用?怎么解决依赖性?”等的问题。
14. B05(治疗→药物相互作用):用于询问所提及的两个及以上药物之间是否会有相互作用的问题?以及某个药物是否会和其他未特指的某个药物发生相互作用的问题。
15. B06(治疗→不限于但可能包含药物治疗):提问者考虑的是一般治疗而不仅是药物,即当问题不是特指药物治疗时归为该类别。对于“能不能治愈?”“有没有治?”“能不能治好?”等问题,同时标注B06(治疗→不限于但可能包含药物治疗)和D03(流行病学→进程/预后/并发症/后遗症)。
16. B99(治疗→有关治疗的其他问题):包括药理作用,药物生产厂家,药物医保性质,药物名称查找,治疗费用及治疗前准备等问题。
17. C01(解剖学/生理学→人体组织/器官/系统):用于询问有关人体的组织、器官及系统的问题。
18. C02(解剖学/生理学→人体代谢):用于询问有关人体的新陈代谢的问题。
19. C99(解剖学/生理学→有关解剖学/生理学的其他问题):用于其他有关人体解剖学和生理学的问题。
20. D01(流行病学→患病率/发病率):提问者关心有没有类似的病人,以及该疾病的患病率/发病率等情况。
21. D02(流行病学→病因学/病原学):该类的问题是询问危险因素和一种病情(在病情发生之前出现的危险因素)之间的相关性,或者(同时出现的)2种或多种病情之间的相关性,以及疾病的遗传性等。如果一个元素不是疾病进程中的一部分,而是该病情(疾病)的一个危险因素,则用D02。
22. 如果两个元素都是同一疾病进程中的一部分(参见D03),则不用D02(流行病学→病因学/病原学)。
23. D02(流行病学→病因学/病原学)不用于药物副作用(参见B03)。
24. 问题为“为什么会出现这样的状况”、“为什么会这样”时,若已知诊断,则归为D02(流行病学→病因学/病原学);若不知诊断,则归为A01(诊断→病因/临床发现的解释)。
25. D03(流行病学→进程/预后/并发症/后遗症):该类询问病情会随着时间发生什么变化。包括单纯的预后问题,以及两个先后发生的病情之间的联系(参见D02)。
26. 诊断已知,问这严重吗/危险性如何等,归为D03(流行病学→进程/预后/并发症/后遗症)。
27. 诊断未知,列出一系列临床表现,问这严重吗,归为A01(诊断—对临床发现的解释)。
28. E(健康生活方式):包括饮食、运动、减肥、压力/情绪管理等。E01(健康生活方式→饮食):主要用于“吃什么食物/营养品好?”、“什么营养品有某某功效吗?”、“食物/营养品X对情况Y好吗?”等问题。以及“我能/能否/可以吃食物X吗?”等问题。
29. 有关营养品的问题,一般归为E01(饮食),但作为药用的营养品/食物也可归为B02(药物选择/适应症/效力)。
30. F01(择医→医疗机构选择):医院选择/推荐。
31. F02(择医→医疗科室选择):就诊科室选择/推荐。
32. F03:(择医→医生选择):医生选择/推荐。
33. F99(择医→其他):关于预约挂号、就医流程、乘车路线,以及虚拟社区、健康网站推荐等问题归为此类。
34. Z99(其他):用于不能归类到A~F中的任何一个类别的问题。
The annotation rules should be used together with the Classification Schema of Consumer Health Questions provided in the website. Details were as following:
1. Since some websites provide a template for users to generate question, therefore, if there are phrases such as “what kinds of help do you want (想得到怎样的帮助)” in the message, then take the sub-sentence after the phrase as the “question”. Otherwise, take the whole message as the “question”.
2. Each code can be annotated only once for each question. That is to say, when a question contains several sub-questions, if these sub-questions belong to the same topic category, then annotate only one code; if the sub-questions belong to different categories, then annotate the topic of the first sub-question as the major code, and the topics of the following questions as the minor codes.
3. For those questions without interrogative words: a. if only clinical findings or conditions were mentioned, and the diagnosis was unknown, then annotate as A01 (diagnosis → interpretation of clinical finding) and B06 (treatment → not limited to but may include drug therapy); b. if one or more disease names were mentioned then regard as the diagnosis was known, and if the therapeutic measures were not mentioned or the mentioned treatment was not limited to drug therapy, then annotate B06 (treatment → not limited to but may include drug therapy); c. if the diagnosis was already known and only the drug therapy was mentioned, then annotate B02(Treatment → drug choice/indications /efficiency); d. if the diagnosis was already known and mentioned that the patient recovered after a period of drug using, then annotate B01 (treatment →how to use drug), the main concern was if the patient can stop using the drugs.
4. A01 (diagnosis → interpretation of clinical finding): The question start with a series of clinical findings and want to know what condition is causing them. The questioner know what the finding is, don’t know what the condition is. It is also a clinical finding if there is a mention of not having the clinical finding. The clinical findings include symptom, sign, test findings and so on.
4.1 Symptoms refer to the patient's subjective abnormal sensation caused by a series of functional, metabolic and morphological abnormal changes, such as pain, discomfort, chills, etc.
4.2 Signs refer to the phenomenon caused by abnormal changes detected by means of physical examination, such as heart murmur, lung rale, high blood pressure (rise), abnormal reflection, etc.
4.3 Clinical test is to examine the specimens of patient's blood, body fluids, secretions, excretions and shedding, by observation, physical, chemical, instrument or molecular biology methods, to provide valuable experimental data for clinical treatment.
5. A02 (diagnosis → criteria/ manifestations): The questioner start with a condition and want to know if clinical findings x1, x2, . . ,xn could be manifestations of that condition. They know what the condition is, but don’t know if findings x1, x2, . . . , xn could be manifestations of that condition.
6. A03 (diagnosis → test): The main concern of the question was the indications for doing a test, and accuracy, timing, methods of the test. Clinical test include body test, library test, electrocardiography, skin test, tissue section test, B-scan ultrasonography, X-ray, CT, MRI, and so on.
7. A04 (diagnosis → orientation of disease & condition): The questioner only knows the name of the disease or situation, but does not know what it is exactly. This category does not apply to any analytical questions.
8. A99 (diagnosis → other): Used for questions about fees of diagnosis and so on.
9. B01 (treatment → how to use drug): The asker knows what drug to use, but don’t know how to use it, such as its dose and timing.
10. B02 (treatment → drug choice/ indications/ efficiency): This category includes therapeutic and preventive drug choice.
11. Questions such as "What can I do to get recovery?" It can be categorized as either B02 (if the drug category is mentioned) or E99 (if no drugs are mentioned).
12. B03 (treatment → adverse effects of drug): It is mainly used for questions that the asker was not sure about the adverse effects of a giving drug and asking “Whether the drug have adverse effects/will cause condition Y? Whether condition Y have any relation with the drug? …”
13. B04 (treatment →contraindications of drug/ matters need attention): It is mainly used for questions that the asker was not sure about the adverse effects of a giving drug and asking “Whether it have any effects on pregnancy/breast feeding/…? Whether a giving condition can use this drug? …” As well as used for questions that the asker have already known the adverse effects of a giving drug and asking “Can I eat this drug? Can I stop using it? How to reduce the side effects? How to get rid of the dependence? …”
14. B05 (treatment → drug interactions): Used for questions that ask about if there are will be interactions between two or more kinds of drug, and if a giving drug will has interactions with some none- mentioned drugs.
15. B06 (treatment → not limited to but may include drug therapy): If the treatment considered was not limited to drug therapy, then annotate as the subordinate class of B02. For questions such as “Is it curable?” annotate both B06 (treatment → not limited to but may include drug therapy) and D03 (epidemiology → course/ prognosis/sequelae).
16. B99 (treatment → other): This category mainly used for questions that ask about pharmacological action, manufacture, and name of the drug, fees of treatment, preparation before treatment, and so on.
17. C01 (anatomy/physiology → tissue/ organ/ system): Used for questions ask about the tissue, organ, and system of human body.
18. C02 (anatomy/physiology → metabolism): Used for questions ask about the metabolism of human body.
19. C99 (anatomy/physiology → other): Used for other issues about the anatomy or physiology of human body.
20. D01 (epidemiology →prevalence/ incidence): The asker want to know if there are patients like them or their family member, and the prevalence or incidence of the disease.
21. D02 (epidemiology → etiology/ causation): This category asks about associations between the risk factors and a condition (the risk factor occurring before the condition) or the relations between two or more conditions that are present at the same time. Use this code when one element, which is not part of the disease process but a risk factor for the condition/disease.
22. Do not use D02 (epidemiology → etiology/ causation) when both elements are part of the same disease progression (see D03).
23. Do not use D02 (epidemiology → etiology/ causation) for adverse drug reactions (See B03).
24. If the question was asking “Why should this happen? …”, and if diagnosis was already known, then annotate D02 (epidemiology → etiology/ causation), if the diagnosis was unknown, then annotate A01 (diagnosis → interpretation of clinical finding).
25. D03 (epidemiology → course/ prognosis/ sequelae): This category asks what will happen to a patient over time. It includes plain prognosis questions as well as associations between two conditions, where one condition occurs before the other (See D02).
26. If the question involve the dangerousness/severity of a disease/condition and the diagnosis is known, then annotate D03 (epidemiology → course/ prognosis/ sequelae).
27. If the question lists a series of clinical manifestations and the diagnosis is unknown, then annotate A01 (diagnosis → interpretation of clinical finding).
28. E (healthy lifestyle): Include diet, exercise, weight loosing, stress and mood management, and so on. For this category: a. E01 (healthy lifestyle → diet) mainly used for questions such as “What kind of food is good for a giving condition? If a mentioned food have a mentioned specific effect on a giving condition? If a mentioned food/nutrition is good for a giving condition? ” as well as questions about if a patient with a giving disease/condition can eat the mentioned food.
29. Questions about nutrition are usually classified as the subordinate class of E01 (healthy lifestyle → diet). Nutrients or food carry medical efficacy can also be classified as the subordinate class of B01 (treatment→drug choice/ indications/ efficiency).
30. F01 (health provider choice →hospital): Mainly ask for a recommendation of hospital to handle a giving disease or condition.
31. F02 (health provider choice → medical department): Mainly ask for a recommendation of medical department to handle a giving disease or condition.
32. F03 (health provider choice → doctor): Mainly ask for a recommendation of doctor to handle a giving disease or condition.
33. F99 (health provider choice → other): Include appointment, the process of seeing a doctor, and the transportation route, as well as virtual community and health website recommendation etc..
34. Z99 (other):Used for questions that unable to be classified to any categories under A~F.