SNOMED CT
Systematized Nomenclature of Medicine - Clinical Term的縮寫,全稱是醫(yī)學(xué)系統(tǒng)化命名-臨床術(shù)語
SNOMED CT(Systematized Nomenclature of Medicine -- Clinical Terms,醫(yī)學(xué)系統(tǒng)命名法-臨床術(shù)語,醫(yī)學(xué)術(shù)語系統(tǒng)命名法-臨床術(shù)語),是一部經(jīng)過系統(tǒng)組織編排的,便于計(jì)算機(jī)處理的醫(yī)學(xué)術(shù)語集,涵蓋大多數(shù)方面的臨床信息,如疾病、所見、操作、微生物、藥物等。采用該術(shù)語集,可以協(xié)調(diào)一致地在不同的學(xué)科、專業(yè)和照護(hù)地點(diǎn)之間實(shí)現(xiàn)對于臨床數(shù)據(jù)的標(biāo)引、存儲(chǔ)、檢索和聚合。同時(shí),它還有助于組織病歷內(nèi)容,減少臨床照護(hù)和科學(xué)研究工作中數(shù)據(jù)采集、編碼及使用方式的變異。
例如,對于心臟病學(xué)專科醫(yī)師來說,心臟病發(fā)作、心肌梗死以及MI可能指的是同一含義,而對于計(jì)算機(jī)來說,三者之間則全然不同。因此,不同的醫(yī)療保健服務(wù)提供者、醫(yī)療服務(wù)設(shè)施、研究人員以及其他相關(guān)方之間需要協(xié)調(diào)一致地交換臨床信息(語義互操作性);而且,不同的地方對于醫(yī)學(xué)信息的記錄方式也各不相同(如采用紙質(zhì)記錄或者采用電子記錄),因而需要一種廣泛全面而又協(xié)調(diào)統(tǒng)一的醫(yī)學(xué)術(shù)語系統(tǒng)(medical terminology system),作為信息基礎(chǔ)結(jié)構(gòu)(information infrastructure)的組成部分。
SNOMED CT目前包括大約321 900條概念(Concept)、超過80萬條臨床概念相關(guān)的描述(Descriptions),和超過700萬條進(jìn)一步描述概念的關(guān)系(Relationships)。
概念 Concept
描述 Description
關(guān)系 Relationship
設(shè)計(jì)體系
概念和描述:每個(gè)唯一性數(shù)字型代碼、唯一性名稱(全稱,即Fully Specified Name)和描述(包括一條首選術(shù)語和一條或多條同義詞)所指定的基本含義單位。 SNOMED CT不再使用詞條表的方式對術(shù)語進(jìn)行表示,而是采用概念的形式。概念以理解為醫(yī)學(xué)中標(biāo)準(zhǔn)的臨床術(shù)語,每個(gè)概念都有唯一的概念碼,但每一個(gè)概念都可能有多個(gè)描述,并且由993420條描述形成了龐大的描述表——我們可以理解成同義詞表。如“Pain in throat”(咽喉痛),在SNOMED CT中是概念,而在實(shí)際應(yīng)用中,它將會(huì)有多種不同的術(shù)語表達(dá),如“Sore throat”、“Throat pain”、“Pain in pharynx”、“Throat discomfort”、“Pharyngeal pain”、“Throat soreness”,但它們并不是概念,而只作為描述被收集在描述表中。每一條概念有若干描述與之對應(yīng),描述表中的每一條描述也有與之相對應(yīng)的概念存在。
SNOMED CT 中共有3種術(shù)語描述類型,即“指定全稱”(Fully Specified Name,F(xiàn)SN)、“首選術(shù)語”(Preferred)及“同義術(shù)語”(Acceptable)。每個(gè)概念都有一個(gè)“指定全稱”和一個(gè)“首選術(shù)語”,同義術(shù)語是除去首選術(shù)語外,其他能夠描述FSN所描述概念的術(shù)語,如圖2所示。
The FSN represents a unique, unambiguous description of a concept's meaning. The FSN is not intended to be displayed in clinical records, but is instead used to disambiguate the distinct meaning of each different concept.
概念描述實(shí)例:
概念 Myocardial infraction 22298006
規(guī)范化全稱:Myocardial infraction(disorder) DescriptionID 751689013
首選術(shù)語:Myocardial infraction DescriptionID 37436014
同義詞:Cardiac infarction DescriptionID 37442013
同義詞:Heart attack DescriptionID 37443015
同義詞:Infarction of heart DescriptionID 37441018
關(guān)系:用于在同一層級結(jié)構(gòu)之內(nèi)或不同層級結(jié)構(gòu)之間將不同的概念聯(lián)系起來。 SNOMED CT中的概念與概念間是有一定“關(guān)系”存在的。概念有36萬條,但關(guān)系有近146萬條。這種基于概念間的語義關(guān)系令數(shù)據(jù)的獲取充分可靠。在SNOMED CT中,關(guān)系分為兩種: IS-A關(guān)系與屬性關(guān)系。
IS-A在同一個(gè)層面中,表示某些概念間的關(guān)系。如關(guān)節(jié)炎屬于關(guān)節(jié)系統(tǒng)疾病,而關(guān)節(jié)系統(tǒng)疾病屬于骨科疾病,這樣關(guān)節(jié)炎→關(guān)節(jié)系統(tǒng)疾病→骨科疾病就形成了一種IS-A關(guān)系;
屬性關(guān)系表示跨層面的概念間的關(guān)系,如“闌尾炎”是一種疾病,但從形態(tài)學(xué)上看,“闌尾炎”屬于炎癥的一種,在屬性關(guān)聯(lián)中,可由“闌尾炎”引導(dǎo)出“炎癥”。
attribute(熟悉)關(guān)系
臨床發(fā)現(xiàn) 手術(shù)/操作 事件 身體結(jié)構(gòu) 藥物/生物 制品、設(shè)備、標(biāo)本等各類概念均有其特有的屬性
臨床發(fā)現(xiàn):發(fā)現(xiàn)的部位、形態(tài)學(xué)表現(xiàn)、與其他(如原因)的關(guān)系、嚴(yán)重度、臨床進(jìn)程等
手術(shù)/操作:手術(shù)部位,手術(shù)目標(biāo)部位,手術(shù)方法,手術(shù)路徑,手術(shù)用藥,優(yōu)先度
由圖我們可以看出,每一個(gè)獨(dú)立的概念會(huì)有若干個(gè)文字化的描述用以表達(dá)。描述可以是同語種中的同義詞,也可以不同語種。概念和概念之間會(huì)有不同的關(guān)系類型來將其鏈接起來。而這些關(guān)系會(huì)不斷的延展下去,最后成為子子孫孫無窮盡也的模式
關(guān)系表記錄了具有臨床意義的全部語義關(guān)聯(lián)組合,運(yùn)用豐富的連接概念,將歸屬于同一概念軸和不同概念軸的概念連接起來。語義關(guān)聯(lián)一方面可以用來組織概念,另一方面也可以構(gòu)成靈活多樣的復(fù)雜概念表達(dá)方式。關(guān)聯(lián)分為兩類:
定義型:定義型關(guān)系是源于概念邏輯定義的一部分
appendicectomy: procedure site = appendix
闌尾切除術(shù):手術(shù)部位 = 闌尾
限定型:限定型關(guān)系提供了一個(gè)可選限定詞,可以應(yīng)用于源概念
appendicectomy: approach = laparascopic
Relationships
SNOMED CT relationships link concepts to other concepts whose meaning is related in some way. These relationships provide formal definitions and other properties of the concept. One type of relationshipis the |is a| relationship which relates a conceptto more general concepts. These |is a| relationships define the hierarchyof SNOMED CT concepts.
- For example, the concepts |bacterial pneumonia| and |viral pneumonia| both have an |is a| relationship to |infective pneumonia| which has an |is a| relationship to the more general concept |pneumonia|.
Other types of relationships represent aspects of the meaning of a concept.
- For example, the concept |viral pneumonia| has a |causative agent| relationship to the concept|virus| and a |finding site|relationshipto the concept |lung|.
Every relationship has a unique numeric relationship identifier.
The set of concepts to which an attribute can be applied is called the 'domain' of the attribute. The permitted set of values for each attribute is called the 'range' of the attribute.
能用某屬性的概念集合叫做domain 屬性的取值集合叫做range
Domain
The domain is the hierarchy to which a specific attribute can be applied.
For example:
The domain of the attribute |associated morphology| is the |clinical finding| hierarchy. Therefore, a |procedure| cannot have an |associated morphology|. However, a |procedure| can have a |procedure morphology|.
Range
The range is the set of SNOMED CT concepts that are allowed as the value of a specified attribute.
For example:
The range for the attribute |associated morphology| is the concept |morphologically abnormal structure| and its subtype descendants.The range for the attribute |finding site| is |anatomical or acquired body structure| and its subtype descendants in the |body structure| hierarchy.
The first example below violates the domain constraint of |causative agent|, as descendants of |body structure| are not in the domain of |causative agent|. The second example below is valid with respect to the domain constraint of |causative agent|, because |disorder| is in the domain of |causative agent|. However, this example violates the range constraint of |causative agent|, as descendants of morphological abnormality are not in the range of |causative agent|.
Expressions
Clinical expressions using SNOMED CT concepts can be of two types: precoordinated expressions, which use a single SNOMED CT concept identifier; and postcoordinated expressions, which contain more than one SNOMED CT identifier. 前組合 后組合
Infective pneumonia 312342009 前組合
病因:細(xì)菌 bacteria 41146007
病理過程:感染 infectious process 441862004 -> 后組合
部位:肺部 lung 39607008
形態(tài)學(xué):炎癥 Inflammation 257552002
Example: Postcoordinated representation of "Laparoscopic emergency appendectomy"
Although SNOMED CT contains the concept |laparoscopic emergency appendectomy|, it is also possible to represent this clinical phrase using the following postcoordinated expression.
-
80146002|appendectomy|:260870009|priority|=25876001|emergency|, 425391005|using access device|=86174004|laparoscope|
This postcoordinated expression has exactly the same meaning as the precoordinated expression
-
174041007|laparoscopic emergency appendectomy|
The fact that the two expressions have the same meaning can be computed because
174041007|laparoscopic emergency appendectomy| is a fully-defined subtype descendant of 80146002|appendectomy| ; and
-
the only differences between the defining attributes of these concepts are the addition of
- 260870009|priority|=25876001|emergency|
- 425391005|using access device| = 86174004|laparoscope|
上面例子說明,及時(shí)有單一概念可以用于表示,同時(shí)也可以用后組合。然而后組合的優(yōu)勢在于,及時(shí)SNOMED CT
中不存在精確概念,它也允許表示臨床短語。
Example: Postcoordinated representation of "Laparoscopic removal of device from abdomen"
SNOMED CT does not contain a concept that represents this clinical idea. However, it is possible to represent it using the following postcoordinated expression.
68526006|removal of device from abdomen|:425391005|using access device|= 6174004|laparoscope|
The basics of SNOMED CT compositional grammar
- At its simplest level a single SNOMED CT concept identifier is a valid expression.
- 80146002
- A concept identifier can optionally be followed by a term associated with that concept enclosed between two pipe characters
- 80146002|appendectomy|
- A concept identifier (with or without a following term) can be followed by a refinement. The refinement follows a colon
- 80146002|appendectomy|:<refinement>
- A refinement consists of a sequence of one or more attribute-value pairs. Both the attribute and the value are represented by a concept identifier (with or without a following term). The attribute is separated from the value by an equals sign
- 80146002|appendectomy|:260870009|priority|=25876001|emergency|
- If there is more than one attribute-value pair, the pairs are separated by commas
- 80146002|appendectomy|:260870009|priority|=25876001|emergency|, 425391005|using access device|=86174004|laparoscope|
- Curly braces represent grouping of attributes within a refinement, for example to indicate that the method applies to a specific site
- 80146002|appendectomy|:{ 260686004|method|=129304002|excision - action|, 405813007|procedure site - direct|= 181255000|entire appendix|}
- Round brackets represent nesting to allow the value of an attribute to be refined
- 161615003|history of surgery|:363589002|associated procedure|=(80146002|appendectomy|:260870009|priority|=25876001|emergency)
編碼
對于每一個(gè)系統(tǒng),其編碼都是有固定格式延續(xù)下來的。SNOMED CT的編碼,我們簡稱為SCTID,為一組不超過18位的數(shù)字。由于歷史原因SCTID存在兩種格式。在SNOMED CT發(fā)布之初,也就是常說的發(fā)布格式1(RF1)或短碼(short format SCTID),其編碼格式為三段式,全部由SNOMED International發(fā)布的。
例子: 70153 00 2
最后一組,一位數(shù),為校驗(yàn)碼。本例為2
倒數(shù)第二組,兩位數(shù),為類別代碼。本例為00
剩余為序列號。本例為70153
在2012年,SNOMED CT的第二個(gè)發(fā)布格式(RF2)開始啟用。其編碼格式變成了四段式,也就是相對而言的長碼(long format SCTID)。這類代碼既可以是SNOMED International發(fā)布的,也可能是由其它授權(quán)機(jī)構(gòu)發(fā)布的擴(kuò)展編碼。
例子: 1227124 1000119 10 9
最后一組,一位數(shù),為校驗(yàn)碼。本例為9
倒數(shù)第二組,兩位數(shù),為類別代碼。本例為10
倒數(shù)第三組,七位數(shù),為機(jī)構(gòu)碼。本例為1000119
剩余為序列號。本例為1227124
校驗(yàn)碼 Check Digit
SNOMED CT的校驗(yàn)碼使用的是Verhoeff算法。該算法相較Modulus 11算法更適合超過10位數(shù)的計(jì)算。使用校驗(yàn)碼有利于提高數(shù)據(jù)錯(cuò)誤的檢驗(yàn)。
類別碼 Partition Identifier
目前使用中的類別碼只有六個(gè):
00 概念(短碼)
01 描述(短碼)
02 關(guān)系(短碼)
10 概念(長碼)
11 描述(長碼)
12 關(guān)系(長碼)
因此當(dāng)SCTID的倒數(shù)第三位數(shù)字是1的話,這個(gè)編碼就是一個(gè)四段式的長碼。如果是0的話,這個(gè)編碼就是一個(gè)三段式的短碼。看碼識(shí)類別是SNOMED CT編碼有別于其它編碼系統(tǒng)的一個(gè)特色。其實(shí)這樣的編碼在應(yīng)用程序的設(shè)計(jì)時(shí)比較方便。
機(jī)構(gòu)代碼 Namespace Identifier
機(jī)構(gòu)代碼是由SNOMED International認(rèn)可的機(jī)構(gòu)代碼,由七位數(shù)字組成。如果不足七位的話使用0從左邊補(bǔ)齊。SNOMED International將最新的機(jī)構(gòu)代碼目錄保存在谷歌服務(wù)器。在上面例子中的機(jī)構(gòu)碼1000119為Kaiser Permanente。
序列號 Extension Item Identifier
序列號就是按照順序排列的數(shù)字,在短碼中理論上可以最多為15位數(shù)字,在長碼中最多為八位數(shù)字。其具體分配方式(順序或隨機(jī))SNOMED International并沒有書面上的規(guī)定,所以可以由編碼機(jī)構(gòu)自行定義。
現(xiàn)在我們再看一下上面給出的兩個(gè)例子:
70153002 為由SNOMED International自身發(fā)布的第70153個(gè)概念。
12271241000119109 為由Kaiser Permanente發(fā)布的概念。
從數(shù)據(jù)類別方面來講,SCTID的長度為18位,超出了32位整數(shù)格式,存儲(chǔ)時(shí)適合使用64位長整數(shù)格式或字符串格式。
19大類目
身體結(jié)構(gòu)(Body structure)
臨床發(fā)現(xiàn)(Clinical finding)
環(huán)境和地理定位(Environment or geographical location)
事件(Event)
觀察對象(Observable entity)
有機(jī)體(Organism)
藥物/生物產(chǎn)品(Pharmaceutical/biologic product)
物理力(Physical force)
物理對象(Physical object)
操作(Procedure)
限定值(Qualifier value)
人為記錄件(Record artifact)
具有明確語境的情況(Situation with explicit context)
SNOMED CT模型組件(SNOMED CT model component)
社會(huì)環(huán)境(Social context)
特殊概念(Special concept)
標(biāo)本(Specimen)
分期與分度(Staging and scales)
物質(zhì)(Substance)
每一個(gè)大類之下會(huì)包含不同的子類別和孫類別,譬如說疾病(disease)和藥物作用(drug action)就是臨床發(fā)現(xiàn)(clinical finding)的子類別。