ich habe auch eine praktikable Lösung gefunden. Wie du bereits erwähnt hast kann man ja die Regexp Terme mit einem oder verknüpfen. Das funktioniert ganz gut für mich.
Code: Alles auswählen
collagen <- c("\\bCOL1A1\\b|\\bCOL1A2\\b|\\bCOL2A1\\b|\\bCOL3A1\\b|\\bCOL4A1\\b|\\bCOL4A2\\b|\\bCOL4A3\\b|\\bCOL4A4\\b|\\bCOL4A5\\b|\\bCOL4A6\\b|\\bCOL5A1\\b|\\bCOL5A2\\b|\\bCOL5A3\\b|\\bCOL6A1\\b|\\bCOL6A2\\b|\\bCOL6A3\\b|\\bCOL6A4P1\\b|\\bCOL6A4P2\\b|\\bCOL6A5\\b|\\bCOL6A6\\b|\\bCOL7A1\\b|\\bCOL8A1\\b|\\bCOL8A2\\b|\\bCOL9A1\\b|\\bCOL9A2\\b|\\bCOL9A3\\b|\\bCOL10A1\\b|\\bCOL11A1\\b|\\bCOL11A2\\b|\\bCOL12A1\\b|\\bCOL13A1\\b|\\bCOL14A1\\b|\\bCOL15A1\\b|\\bCOL16A1\\b|\\bCOL17A1\\b|\\bCOL18A1\\b|\\bCOL19A1\\b|\\bCOL20A1\\b|\\bCOL21A1\\b|\\bCOL22A1\\b|\\bCOL23A1\\b|\\bCOL24A1\\b|\\bCOL25A1\\b|\\bCOL26A1\\b|\\bCOL27A1\\b|\\bCOL28A1|\\bOI4\\b|\\bOI4\\b|\\bSEDC\\b|\\b AOM\\b|\\b STL1\\b|\\bEDS4A\\b|\\bFLJ22259\\b|\\b DKFZp686I14213\\b|\\bCA44\\b|\\bASLN\\b|\\b ATS\\b|\\bDVWA\\b|\\b VWA6\\b|\\b DIVA\\b|\\b COL6A4\\b|\\b COL6A4P\\b|\\bCOL6A4\\b|\\bCOL29A1\\b|\\b FLJ35880\\b|\\b VWA4\\b|\\bEBDCT\\b|\\b EBD1\\b|\\b EBR1\\b|\\bC3orf7\\b|\\b MGC9568\\b|\\bFECD\\b|\\b PPCD\\b|\\b FECD1\\b|\\b PPCD2\\b|\\bEDM2\\b|\\b MED\\b|\\bIDD\\b|\\b MED\\b|\\b EDM3\\b|\\b FLJ90759\\b|\\b DJ885L7.4.1\\b|\\bCOLL6\\b|\\b STL2\\b|\\b CO11A1\\b|\\bDFNA13\\b|\\b DFNB53\\b|\\b HKE5\\b|\\bCOL12A1L\\b|\\bUND\\b|\\bBPAG2\\b|\\b BP180\\b|\\bKNO\\b|\\b KS\\b|\\b KNO1\\b|\\bKIAA1510\\b|\\bDKFZp434K0621\\b|\\bEMID2\\b|\\b Emu2\\b|\\b EMI6\\b|\\bKIAA1870\\b|\\b MGC11337\\b|\\b FLJ11895\\b") # Alle Collagene inklusive Aliase aus gennames.org
Barriere62 <- Daten[grepl(collagen, Daten$`Primary Sequence Name`),] # Genselection z.B.- Barriere etc
Werner