GENSCAN 1.0 Date run: 3-Nov-116 Time: 22:25:50 Sequence gi568815586f:15782708_16002975 : 220268 bp : 38.92% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5624 5758 135 0 0 60 66 128 0.183 7.42 1.02 Intr + 6372 6510 139 1 1 52 59 122 0.113 4.30 1.03 Term + 20534 20936 403 2 1 -20 49 336 0.756 12.54 1.04 PlyA + 23057 23062 6 1.05 2.04 PlyA - 23657 23652 6 1.05 2.03 Term - 55842 55693 150 0 0 40 55 127 0.439 1.53 2.02 Intr - 62733 62647 87 0 0 68 98 55 0.854 3.75 2.01 Init - 66365 66267 99 2 0 62 115 54 0.908 5.83 2.00 Prom - 76585 76546 40 -4.15 3.00 Prom + 89022 89061 40 -4.75 3.01 Init + 94184 94415 232 1 1 38 14 201 0.498 6.17 3.02 Intr + 99613 99733 121 2 1 53 -26 191 0.363 2.83 3.03 Intr + 99886 100112 227 1 2 58 105 277 0.790 23.11 3.04 Intr + 100834 100969 136 2 1 47 94 170 0.999 12.21 3.05 Intr + 107221 107302 82 1 1 88 69 82 0.985 5.02 3.06 Intr + 107890 107962 73 0 1 101 115 8 0.986 2.86 3.07 Intr + 111340 111436 97 2 1 28 123 51 0.983 0.75 3.08 Intr + 112652 112789 138 2 0 109 95 31 0.955 4.56 3.09 Intr + 115175 115311 137 2 2 50 95 104 0.999 6.59 3.10 Intr + 117197 117346 150 0 0 101 99 104 0.998 12.01 3.11 Intr + 118240 118305 66 2 0 107 67 84 0.662 6.16 3.12 Intr + 120291 120436 146 1 2 31 95 98 0.320 3.88 3.13 Term + 121820 121885 66 1 0 58 47 43 0.050 -5.84 3.14 PlyA + 125476 125481 6 1.05 4.11 PlyA - 125527 125522 6 1.05 4.10 Term - 128524 128450 75 1 0 18 42 110 0.196 -3.74 4.09 Intr - 128821 128648 174 1 0 83 70 111 0.409 8.01 4.08 Intr - 129878 129801 78 2 0 33 110 68 0.095 2.33 4.07 Intr - 135038 134958 81 2 0 43 60 107 0.197 2.32 4.06 Intr - 146352 146210 143 1 2 52 78 117 0.284 6.25 4.05 Intr - 162774 162607 168 1 0 33 86 136 0.121 6.90 4.04 Intr - 164354 163206 1149 0 0 1 40 524 0.021 27.02 4.03 Intr - 166399 166131 269 0 2 -9 71 225 0.045 7.35 4.02 Intr - 178478 178386 93 1 0 131 93 21 0.005 5.06 4.01 Init - 184276 183993 284 1 2 53 41 234 0.152 11.76 4.00 Prom - 188795 188756 40 -6.25 5.05 PlyA - 188807 188802 6 -0.45 5.04 Term - 190131 189600 532 2 1 20 48 368 0.311 18.73 5.03 Intr - 191461 191430 32 1 2 80 66 26 0.236 -4.39 5.02 Intr - 193880 193704 177 2 0 42 78 148 0.615 8.39 5.01 Init - 200845 200702 144 1 0 66 53 114 0.585 5.87 5.00 Prom - 202981 202942 40 -5.65 6.03 PlyA - 203227 203222 6 1.05 6.02 Term - 205164 205021 144 0 0 65 48 144 0.749 5.03 6.01 Init - 207959 207840 120 2 0 45 82 132 0.617 8.54 6.00 Prom - 211201 211162 40 -4.55 7.00 Prom + 212588 212627 40 -5.95 7.01 Init + 217323 217378 56 2 2 60 97 23 0.296 1.21 7.02 Term + 218695 218881 187 1 1 108 42 119 0.421 5.28 7.03 PlyA + 218962 218967 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 18990 19066 77 2 2 50 80 70 0.907 3.01 S.002 Init - 135742 135620 123 1 0 67 86 84 0.828 6.32 S.003 Intr + 175481 175628 148 1 1 61 90 140 0.921 10.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:15782708_16002975|GENSCAN_predicted_peptide_1|225_aa XVCGFQILTTASCPPRDSNSTTGPMGGGGFEKHCRRTVGAKILQTAHKGGFLKSRKSKPP DPGTGAFRQGALTSDPPSGPRLSPETPRAQTRDPLIDKEHYGKPVAEVTEEEKNDQELKE TQLIKAAPTMTTSSVFEDPTISKFTNMMMKGENKVLARSFMTQVLEAMRSTMMLLQGNRQ PLNTTPIPEVPNRFGTRNWFPPTGERGLMGEGWKGLMISSGTGFS >gi568815586f:15782708_16002975|GENSCAN_predicted_CDS_1|678_bp natgtctgcggatttcaaatcctaaccacggcgtcctgccccccgcgagattccaacagt actacgggccctatgggaggaggaggatttgagaaacactgccggagaacggtcggtgca aagattttgcaaacagcacacaaaggcggatttttaaaatcgagaaagtcaaagccgccg gaccccgggaccggcgcctttcgccaaggggctcttacctcggatccgccctcggggcca cggctttccccggagacgccgcgagcccagacgagagatcccttgattgacaaggaacat tatggcaagccagtggctgaagtaactgaggaggagaagaatgatcaggaactcaaggag actcagctcatcaaagctgctccaacaatgacaacgagctctgtgtttgaagaccccaca atcagtaaattcaccaacatgatgatgaagggagaaaacaaagtactagccagatccttc atgactcaggttctggaagctatgagaagtaccatgatgcttctgcaggggaacaggcaa ccattgaacacaacccctataccagaggtccccaaccgttttggcaccagaaactggttc ccaccgacaggggagagggggttgatgggagaggggtggaagggattgatgatttcatca ggcactggattctcataa >gi568815586f:15782708_16002975|GENSCAN_predicted_peptide_2|111_aa MVSPIYMFLYNLATIQEVKSVSPSWAIVAASNKELICLTISTKTSHQQEATCGVHILRSN GKSVAKDHTSHVSEHNIVYVAGVTAAASVGREIGRRGKVGSSNSFRAMPWQ >gi568815586f:15782708_16002975|GENSCAN_predicted_CDS_2|336_bp atggtatctcctatctatatgttcctctataaccttgccactatccaagaggtgaagtct gtgtccccttcctgggccattgtggcagcttcaaataaggaactgatctgccttactatt tccacaaaaactagtcatcagcaagaagctacctgtggggtgcacattctcaggtcaaat ggtaagtctgtagcaaaagaccatacatcacacgtctcagagcacaacatcgtgtatgta gctggggtgacagcagcagcctcagtaggaagagagattgggagaagaggcaaagttggc agcagcaacagcttcagggcaatgccatggcagtga >gi568815586f:15782708_16002975|GENSCAN_predicted_peptide_3|556_aa MRNSLESLSDRIEYVENRNSELEDKVFELTQTNKDKEKRIRKYEQSLQEVWGYRKRPNLR RIGVPEEEENSKILENIFTSTRRARPPANRHFRRVLCVIYVVTSCRCRCGRCESWLARPA LVSGRGPGAARGTAAEERKDNDDPQLASPVAGFAAAMAMRQTPLTCSGHTRPVVDLAFSG ITPYGYFLISACKDGKPMLRQGDTGDWIGTFLGHKGAVWGATLNKDATKAATAAADFTAK VWDAVSGDELMTLAHKHIVKTVDFTQDSNYLLTGGQDKLLRIYDLNKPEAEPKEISGHTS GIKKALWCSEDKQILSADDKTVRLWDHATMTEVKSLNFNMSVSSMEYIPEGEILVITYGR SIAFHSAVSLDPIKSFEAPATINSASLHPEKEFLVAGGEDFKLYKYDYNSGEELESYKGH FGPIHCVRFSPDGELYASGSEDGTLRLWQTVVGKTYGLWKCVLPEEDSGELAKPKIGFPE TTEEELVSIQLTKTSKQRKASAFQSYCLLKAETAVNNEENELAPVLEQLTNLVLPVQSFV YWFPAAAVTSDHKCVP >gi568815586f:15782708_16002975|GENSCAN_predicted_CDS_3|1671_bp atgcgaaattctctggaaagtctcagcgatagaattgaatacgtagaaaacagaaattca gagcttgaagacaaggtttttgaattaacccaaaccaacaaagacaaagaaaaaagaata agaaaatatgaacaaagcctccaagaagtctggggttacaggaaacgaccaaacctaaga agaattggtgttcctgaagaagaagagaattctaaaatcttggaaaacatattcacttct acccggcgagcccgccctcccgccaatcgtcatttccggcgggtgctctgcgtcatttac gtcgtcacttcctgccgatgccggtgtggacgctgtgaatcgtggctggcccgcccagcc ctagtgtcagggcgggggcctggagcagcccgaggcactgcagcagaagagagaaaagac aacgacgaccctcagctcgccagtccggtcgctggcttcgccgccgccatggcaatgaga cagacgccgctcacctgctctggccacacgcgacccgtggttgatttggccttcagtggc atcacgccttatgggtatttcttaatcagcgcttgcaaagatggtaaacctatgctacgc cagggagatacaggagactggattggaacatttttgggtcataaaggtgctgtttggggt gcaacactgaataaggatgccaccaaagcagctacagcagctgcagatttcacagccaaa gtgtgggatgctgtctcaggagatgaattgatgaccctggctcataaacacattgtcaag actgtggatttcacgcaggatagtaattatttgttaaccgggggacaggataaactgtta cgcatatatgacttgaacaaacctgaagcagaacctaaggaaattagtggtcatacttct ggtataaaaaaagctctgtggtgcagtgaggataaacagattctttctgctgatgacaaa actgttcgactttgggatcatgctactatgacagaagtgaaatctctaaattttaatatg tctgttagtagtatggaatatattcctgagggagagattttggttataacttatggacga tctattgcttttcatagtgcagtaagtttggacccaattaaatcctttgaagctcctgca accatcaattctgcatctcttcatcctgagaaagaatttcttgttgcaggcggtgaagat tttaaactttataagtatgattataatagtggagaagaattagaatcctacaagggacac tttggtcctattcactgtgtgagatttagtcctgatggagaactctatgccagtggttca gaagatggaacattgagactatggcaaactgtggtaggaaaaacgtatggcctttggaaa tgtgtgcttcctgaagaagatagtggtgagctggcaaagccaaagattggttttccagag acaacagaagaggagctagttagtatacaactgactaaaacaagcaagcagagaaaagca tcagccttccagagttactgtctgcttaaggcagaaacagcagtaaataatgaggaaaat gaattagctccagtgctggaacaactaactaacttggtgttacctgtacagtcctttgtg tattggtttcctgctgctgctgtaacaagtgaccacaaatgtgtaccttaa >gi568815586f:15782708_16002975|GENSCAN_predicted_peptide_4|837_aa MGKMQHGVVIQRNTSSGAGGPGSRWSWCSGGLVVFAIRNVTEISKDMTFPKESTPRLWKS HTGTESENHSSQILRNMDTSGMTEEVKELANETQGFCVAFIRSCMYCDKSPLTPFHSFSN LSSTIRSRLTPHTAGYPSETKLPEERSDSNICCSAIFAVLQPLLLTPRQTGSGVDLQQTP TDLQLRVLTVRRKTNKQKGHPHQNPICTSPSSKTKEIRIKKLTQNCSTTWKLNNLLLNNY WVHNEMKAEIKMFFETNENKVITYQNLWDTFKAVCRVKFIALNAHRKKQERSKIDTLTSQ LKELEKQERTHLTASRRQKITEIRAELKEIETQKTLRKINESRSWFFEKINKIDRLLARL IKKKTEKNQIDAIKKKGDITTNPTETQTTIREYYKHLYANKLENLEEMDKFLDTYILPRL NQEEVESLNRPITGSEIEAIINTLPTKKSPGPDRFTAEFYQRYKEELVAFLLKLFQSIEK EGILPDSFYEASIILIPKPSRDTTKRENFRPISLMNIDAKILNKILANRIQQHIKKLIHH DQVGFIPGMQGWFNVHKSINVIQHINRSKDKNHIISIDAEKAFDKIQQPFLLKILNKLEH KIPRNPTYKGCEGPLQGELQTTAQRNQRGHKQMEEHSMLMDRKNQYRENGHTAQAEPEAL VEEVTWLSQCSKGADSCPVFTENAAPVCRTGFNSYRSRRLRKSVEKLSSTKPLQSLVPKR SGTVGVKQRVTTTKITILQMKKLKRKVINLSEVTQRKPLGDPQPGHWAAAGPALVPAALN ARERGWGAPRAPYRARCPDCARTWRGQLRSCRLEYCSARKSRCSPPPSSPRGTEPER >gi568815586f:15782708_16002975|GENSCAN_predicted_CDS_4|2514_bp atgggtaaaatgcaacatggtgtagtgatacaacggaatacgagttctggagcaggtggc cccgggtccaggtggtcctggtgctcaggaggtttggttgtctttgccataaggaatgta actgaaatcagcaaggacatgacgtttccaaaggaaagcacaccaaggttgtggaaaagt cacactgggacagaatctgagaaccacagttctcagattctgagaaacatggacacttca gggatgacagaggaggtgaaggagcttgcaaatgagacgcaggggttttgtgtggctttt atcaggtcctgcatgtactgtgacaagtctcccctaactccatttcactccttttctaat ttatccagtaccatcaggagcagactgacaccccacacggccggttacccctctgagacg aagctcccagaggaacgatcagacagcaacatttgctgttcagcaatattcgccgttctg cagcctctgctgctgacaccaaggcaaacagggtctggagtggacctccagcaaactcca acagacctgcagctgagggtcctgactgttagaaggaaaactaacaaacagaaaggacat ccacaccaaaaccccatctgtacgtcaccatcatcaaagaccaaagaaatcaggattaag aaactcactcaaaactgctcaactacatggaaactgaacaacctgctcctgaataactac tgggtacataatgaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaa gttataacataccagaatctctgggacacatttaaagcagtgtgtagagtgaaatttata gcactaaatgcccacaggaaaaagcaggaaagatctaaaattgacaccctaacatcacaa ttaaaagaactagagaagcaagagcgaacacatttaacagctagcagaaggcaaaaaata actgagatcagagcagaactaaaggagatagagacacaaaaaacccttcgaaaaatcaat gaatccaggagctggttttttgaaaagatcaacaaaattgatagactgctagcaagacta ataaagaagaaaacagagaagaatcaaatagatgcaataaaaaaaaaaggggatatcacc accaatcccacagaaacacaaactaccatcagagaatactataaacacctctatgcaaat aaactagaaaatctagaagaaatggataaattcctggacacatacatcctcccaagacta aaccaggaagaagttgaatccctgaatagaccaataacaggctctgaaattgaggcaata atcaataccttaccaaccaaaaaaagtccaggaccagacagattcacagccgaattctac cagaggtacaaggaggagctggtagcattccttctgaaactattccaatcaatagaaaaa gagggaatcctccctgactcattttatgaggccagcatcatcctgataccaaagcctagc agagacacaacaaaaagagagaattttagaccaatatccctgatgaacatcgatgcaaaa atcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccat gatcaagtgggcttcatccctgggatgcaaggctggttcaacgtacacaaatcaataaat gtaatccagcatataaacagaagcaaagacaaaaaccacattatctcaatagatgcagaa aaggcctttgacaaaattcaacagcccttcttgctaaaaattctcaataaattagagcat aaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaa accactgctcaacgaaatcaaagaggacacaaacaaatggaagaacattccatgctcatg gataggaagaatcaatatcgtgaaaatggccatactgcccaagctgaacccgaagccctg gtggaagaagtcacgtggctgagccagtgctctaaaggagctgactcctgtccagttttc acagagaacgctgctcccgtctgcaggacaggatttaacagctacagatctagaaggctg agaaaatctgtggaaaaactgtcctccacaaaaccacttcagtccctggtgccaaaaagg tccgggaccgttggtgtaaagcaaagggtaactaccactaaaataaccattttacagatg aagaaacttaagcgcaaagttattaatttgtccgaagtcacacagcgaaaacccctcggg gacccacagccagggcactgggcggcagcaggccccgcgctagtcccggcggcgctgaac gcgagggagaggggatggggagccccgcgggccccttaccgagctcggtgccccgattgt gcgcggacatggcgcgggcagctccggagctgccggctggaatattgctccgcaaggaaa tctcgctgctcccctcccccgtcttctccacgaggtaccgaaccggagcgctaa >gi568815586f:15782708_16002975|GENSCAN_predicted_peptide_5|294_aa MGQTAGGEAVSGEIIPEKGGTCAEPQDVELFGQKAFRAGGTERTKTQKKDDSELYLWELS TTDKGEQGFSQPHTAKKVERNLAPMSLGPREILGLHVATSWQHDGGEEIHTEVFKGNRLT SKSSTRSWSPAPLPLIPERRSLLLRVVSNFPLSPVLTAVARDHGHTPHTLASHHIHRSLS GCTLEVPDFCTMREMPLGHIMRQELEKISLEYIMRCLREVSFCYPQNFLGKEVVDNCIQE HIKQLHCKGALRDSQLAGMCAVSLKGELWGDQFTPQFWEQQEGPRGHWLPPVPH >gi568815586f:15782708_16002975|GENSCAN_predicted_CDS_5|885_bp atgggccaaacagcagggggagaagcagtgtcaggggaaataattcctgaaaaaggtggt acctgtgctgaaccccaagatgtggagttatttgggcaaaaggcattccgggcaggaggc acagaacgtacaaagactcaaaagaaagatgattctgaactctatctttgggagctctca accacagacaagggagagcagggattttctcagccacacactgccaagaaggtggagaga aacctagcccccatgtcactggggcctagagaaattcttggcctgcatgttgctacatca tggcaacatgatggaggcgaggaaatacacactgaagtatttaagggtaacagattaact tccaagtccagcactagatcgtggagtcccgcaccactcccactgatccctgaacgcaga tcccttctcctgagagttgtgagcaactttcccctgtccccagtactgactgcggtggct cgggaccatggccacactccccacaccttggcttcacatcacatacatcgttccctctct gggtgcacgctggaggtccccgacttctgcactatgcgtgaaatgcccctgggacacatc atgaggcaggaattggagaaaatttccctagagtacatcatgcgctgtctgcgggaagtc agcttctgctacccgcaaaacttcctgggcaaggaggtggtggacaactgcatccaggag cacattaaacagctgcattgcaagggggccctgcgggacagccagctggctgggatgtgt gcagtgtctctaaagggtgaactgtggggtgatcagttcaccccacagttctgggagcaa caagaagggccgcgaggccattggcttcctcctgtccctcactga >gi568815586f:15782708_16002975|GENSCAN_predicted_peptide_6|87_aa MTTHRHIKENNRHWGLLEFGGWKEEEDQKKRQMDTQINIWLCDPGQQCEPSAACRAKWVE QVQQPGAKLKQKCTEVSDWQSDTLKIL >gi568815586f:15782708_16002975|GENSCAN_predicted_CDS_6|264_bp atgacaacacatagacacatcaaggagaataacagacactggggtcttttggagtttgga gggtggaaggaggaagaggatcagaaaaaacgacaaatggatactcagattaacatctgg ctgtgcgacccaggccagcagtgtgagccaagtgcagcctgccgggccaagtgggtggaa caagtccagcaaccaggagcaaaactcaagcagaagtgcacggaggtttctgactggcaa agcgacaccctaaagatcctgtga >gi568815586f:15782708_16002975|GENSCAN_predicted_peptide_7|80_aa MEQSRCLWDISGGKFDCAWARVEKFSVLFETIDSEKRTTHTETRKRFVLSNKERVIQTCG VMVLGLCDVFDALQGGQFLG >gi568815586f:15782708_16002975|GENSCAN_predicted_CDS_7|243_bp atggagcagagtagatgcctgtgggacatatcggggggcaaatttgattgtgcttgggct agagtggaaaagttcagtgttttattcgagacaattgactcagagaaaagaacaactcac acagagaccagaaagagattcgtgctcagtaacaaagagagagtaatacagacttgtgga gtcatggtcttaggtctgtgtgacgtgtttgatgctctgcaaggtggtcagtttctgggg tga