GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:28:04 Sequence gi568815597f:100033169_100248817 : 215649 bp : 38.03% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1079 1074 6 1.05 1.02 Term - 5847 5227 621 0 0 78 44 488 0.289 36.82 1.01 Init - 9841 9836 6 1 0 94 94 0 0.268 2.13 1.00 Prom - 11713 11674 40 -5.15 2.00 Prom + 13804 13843 40 -3.55 2.01 Init + 20624 20626 3 1 0 89 89 0 0.492 0.25 2.02 Intr + 25498 25560 63 0 0 102 97 68 0.984 7.10 2.03 Intr + 26710 26837 128 0 2 71 80 45 0.995 0.56 2.04 Intr + 28667 28781 115 2 1 127 84 51 0.996 8.13 2.05 Intr + 34809 35027 219 2 0 107 115 176 0.999 19.78 2.06 Intr + 36446 36517 72 1 0 69 109 21 0.660 0.98 2.07 Intr + 43997 44109 113 1 2 127 111 -8 0.962 3.66 2.08 Intr + 45283 45380 98 1 2 101 96 137 0.986 14.53 2.09 Intr + 47328 47491 164 1 2 31 91 105 0.835 3.87 2.10 Term + 48835 49041 207 0 0 87 49 38 0.758 -3.84 2.11 PlyA + 50171 50176 6 1.05 3.15 PlyA - 50424 50419 6 1.05 3.14 Term - 52266 52160 107 1 2 126 31 50 0.928 0.69 3.13 Intr - 52462 52368 95 0 2 64 115 35 0.936 2.49 3.12 Intr - 55068 54971 98 0 2 106 106 13 0.914 2.79 3.11 Intr - 69915 69787 129 0 0 102 63 52 0.923 4.07 3.10 Intr - 72735 72599 137 1 2 55 83 71 0.877 2.67 3.09 Intr - 73825 73744 82 1 1 63 84 51 0.900 0.49 3.08 Intr - 74385 74206 180 0 0 -21 53 207 0.925 5.44 3.07 Intr - 74549 74460 90 2 0 27 76 97 0.761 1.67 3.06 Intr - 74836 74642 195 1 0 130 19 197 0.999 15.69 3.05 Intr - 77315 77124 192 2 0 57 57 151 0.971 7.67 3.04 Intr - 85969 85850 120 1 0 30 110 125 0.995 8.67 3.03 Intr - 88381 88210 172 0 1 69 116 39 0.656 3.82 3.02 Intr - 89316 89212 105 2 0 68 115 37 0.708 2.81 3.01 Init - 99521 99382 140 2 2 102 31 156 0.502 10.97 3.00 Prom - 100984 100945 40 -7.85 4.00 Prom + 101095 101134 40 -8.15 4.01 Init + 106376 106382 7 1 1 76 92 5 0.732 0.57 4.02 Intr + 107697 107851 155 1 2 50 115 80 0.783 5.77 4.03 Intr + 108484 108653 170 0 2 53 63 96 0.594 1.42 4.04 Intr + 110875 110975 101 1 2 43 98 28 0.584 -1.87 4.05 Intr + 114726 115158 433 1 1 62 86 402 0.884 29.18 4.06 Term + 115457 115652 196 2 1 55 39 200 0.998 7.70 4.07 PlyA + 116170 116175 6 1.05 5.08 PlyA - 116639 116634 6 1.05 5.07 Term - 117669 117482 188 1 2 23 48 99 0.561 -3.93 5.06 Intr - 119356 119217 140 0 2 58 75 152 0.822 10.09 5.05 Intr - 123149 123004 146 2 2 84 93 115 0.994 9.76 5.04 Intr - 125199 125063 137 1 2 78 50 103 0.991 4.87 5.03 Intr - 126247 126091 157 1 1 67 91 64 0.684 3.26 5.02 Intr - 127403 127298 106 1 1 79 121 90 0.295 10.70 5.01 Init - 135348 135236 113 0 2 71 29 148 0.005 6.93 5.00 Prom - 146986 146947 40 -4.75 6.13 PlyA - 148883 148878 6 1.05 6.12 Term - 163254 163087 168 0 0 121 49 126 0.943 8.90 6.11 Intr - 167652 167482 171 0 0 69 5 154 0.884 4.42 6.10 Intr - 173133 173062 72 0 0 67 83 64 0.846 2.48 6.09 Intr - 173468 173277 192 2 0 78 70 160 0.912 11.97 6.08 Intr - 180497 180281 217 1 1 25 64 325 0.621 21.38 6.07 Intr - 181348 181323 26 1 2 78 43 4 0.765 -9.19 6.06 Intr - 181815 181649 167 1 2 79 100 170 0.949 16.06 6.05 Intr - 183031 182815 217 1 1 20 90 165 0.932 7.05 6.04 Intr - 185579 185458 122 0 2 46 101 145 0.692 10.89 6.03 Intr - 197746 197565 182 0 2 125 67 165 0.537 16.69 6.02 Intr - 202343 202268 76 0 1 40 115 98 0.994 5.35 6.01 Intr - 207716 207593 124 2 1 94 119 32 0.987 6.14 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 133091 133710 620 0 2 1 44 406 0.868 21.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:100033169_100248817|GENSCAN_predicted_peptide_1|208_aa MEAMTDSNTFDLKTHSDDFSLKTSRFECRPQVSAVMLPTPGSEGSPSPVRVYCKQPRAGA PTQRDVETESERTTGPQRDPRARRHPVSNIWESRDGSQPLDKRGDGSAGPGGEGEARLRR RGARRAGRRGSPAHSSPRRRRTWGSRPQGLQRRPEQRAAGAGKKEDGPRRPQARAAEDDS RSGGMRVRDPEAPAARTGWGKGGRHRDA >gi568815597f:100033169_100248817|GENSCAN_predicted_CDS_1|627_bp atggaggccatgactgattctaatacttttgatcttaagacacacagtgatgacttttct ttgaaaactagcaggttcgaatgtaggccccaggtgtctgccgtcatgcttccgactccg ggaagtgagggaagcccatctccggtcagggtttactgtaagcagcccagagccggagca ccgacgcagagagacgtggaaacagaaagcgaacgtactacggggccgcagcgggatccc cgcgccagacgccaccccgttagcaacatatgggagagcagggatggttcccagcccctt gacaagcgcggcgacggctccgcggggcctggcggcgaaggcgaggcgaggctgcggcga cgaggggcgcggcgggcgggccgccgggggagccccgcacattcctccccgcggcggagg cgcacgtggggttcgaggccgcagggcctgcagaggcggccagagcagagggcggcaggt gcgggaaagaaagaggacgggccccggaggccgcaagcccgagcagcggaggacgattcc cggagcggcggcatgcgggtccgcgaccccgaggcaccggcagctcggacagggtggggg aagggagggaggcaccgggacgcctag >gi568815597f:100033169_100248817|GENSCAN_predicted_peptide_2|393_aa MVLHETFPKHTFLMNGLIQGVKGLLSFLSAPLIGALSDVWGRKSFLLLTVFFTCAPIPLM KISPWWYFAVISVSGVFAVTFSVVFAYVADITQEHERSMAYGLVSATFAASLVTSPAIGA YLGRVYGDSLVVVLATAIALLDICFILVAVPESLPEKMRPASWGAPISWEQADPFAIMKF SPESVAAFIAVLGILSIIAQTIVLSLLMRSIGNKNTILLGLGFQILQLAWYGFGSEPWMM WAAGAVAAMSSITFPAVSALVSRTADADQQGVVQGMITGIRGLCNGLGPALYGFIFYIFH VELKELPITGTDLGTNTSPQHHFEQNSIIPGPPFLFGACSVLLALLVALFIPEHTNLSLR SSSWRKHCGSHSHPHNTQAPGEAKEPLLQDTNV >gi568815597f:100033169_100248817|GENSCAN_predicted_CDS_2|1182_bp atggtattacatgaaacctttcctaaacatacatttctgatgaacggcttaattcaagga gtaaagggtttgttgtcattccttagtgccccgcttattggtgctctttctgatgtttgg ggccgaaaatccttcttgctgctaacggtgtttttcacatgtgccccaattcctttaatg aagatcagcccatggtggtactttgctgttatctctgtttctggggtttttgcagtgact ttttctgtggtatttgcatacgtagcagatataacccaagagcatgaaagaagtatggct tatggactggtttcagcaacatttgctgcaagtttagtcaccagtcctgcaattggagct tatcttggacgagtatatggggacagcttggtggtggtcttagctacagcaatagctttg ctagatatttgttttatccttgttgctgtgccagagtcgttgcctgagaaaatgcggcca gcatcctggggagcacccatttcctgggaacaagctgacccttttgcgataatgaaattt tcaccagaaagtgttgcagcgtttatagcagtccttggcattctttccattattgcacag accatagtcttgagtttacttatgaggtcaattggaaataagaacaccattttactgggt ctaggatttcaaatattacagttggcatggtatggctttggttcagaaccttggatgatg tgggctgctggggcagtagcagccatgtctagcatcacctttcctgctgtcagtgcactt gtttcacgaactgctgatgctgatcaacagggtgtcgttcaaggaatgataacaggaatt cgaggattatgcaatggtctgggaccggccctctatggattcattttctacatattccat gtggaacttaaagaactgccaataacaggaacagacttgggaacaaacacaagccctcag caccactttgaacagaattccatcatccctggccctcccttcctatttggagcctgttca gtactgctggctctgcttgttgccttgtttattccggaacataccaatttaagcttaagg tccagcagttggagaaagcactgtggcagtcacagccatcctcataatacacaagcgcca ggagaggccaaagaacctttactccaggacacaaatgtgtga >gi568815597f:100033169_100248817|GENSCAN_predicted_peptide_3|613_aa MAVLARGIHLAPRCRVGPPRKRIGNPEGPVVTSLTKSAVAEGTGGRVLKFQQGLLVDFLA FPQKFIDLLQQCTQEHAKEIPRFLLQLVSPAAILDNSPAFLNVVETNPFKHLTHLSLKLL PGNDVEIKKFLAGCLKCSKTLAEKKQELDKLRNEWASHTAALTNKHSQELTNEKEKALQA QVQYQQQHEQQKKDLEILHQQNIHQLQNRLSELEAANKDLTERKYKGDSTIRELKAKLSG VEEELQRTKQEVLSLRRENSTLDVECHEKEKHVNQLQTKVAVLEQEIKDKDQLVLRTKEA FDTIQEQKVVLEENGEKNQVQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLMGKLKLK NTVTIQQEKLLAEKEEKLQKEQKELQDVGQSLRIKEQEVCKLQEQLEATVKKLEESKQLL KNNEKLITWLNKELNENQLVRKQDVLGPSTTPPAHSSSNTIRSGISPNLNVVDGRLTYPT CGIGYPVSSAFAFQNTFPHSISAKNTSHPGSGTKVQFNLQFTKPNASLGDVQSGATISMP CSTDKENGENVGLESKYLKKREDSIPLRGLSQNLFSNSDHQRDGTLGALHTSSKPTALPS ASSAYFPGQLPNS >gi568815597f:100033169_100248817|GENSCAN_predicted_CDS_3|1842_bp atggcggttctggcccgtgggatccaccttgcacctaggtgtcgagtcggcccccctagg aagcggatagggaacccagagggcccggtggttacctctctcaccaagtcggccgttgca gaggggactggaggaagggttttaaaattccagcaaggtcttctggtagacttcttagct ttcccacaaaaatttatagatctccttcagcaatgtactcaagaacatgccaaagaaatt ccaaggtttttgctacagttagtttctccagcagctattttggataactcacctgcattt ttaaatgtggtagagacaaatccttttaagcatcttacacacctctcactaaaactttta cctggaaatgatgtggagataaagaaatttctcgcaggctgtttgaaatgtagcaagaca ttagcagaaaaaaaacaagaattagataagttacggaatgaatgggcgtcacatacagca gccttgacaaacaagcattctcaggaactgacaaatgaaaaggaaaaagccttgcaggca caggttcaatatcaacagcagcatgaacaacagaaaaaagatttagaaatcctccatcaa caaaacatccaccagctacaaaacagactgtctgagttagaagcggctaataaagactta accgaaagaaaatataaaggagactccactattagagaacttaaagcaaaactttctggt gttgaagaggagctacagcggactaagcaagaagtcctctctttgcgaagagagaattct acactagatgttgaatgccacgagaaagaaaagcacgttaatcagctacaaacaaaagtg gcagttttagaacaggaaatcaaggataaggaccagcttgttttaagaacaaaagaggca tttgatacaatccaggaacaaaaggtggttttagaagaaaatggtgagaaaaatcaagta caactaggaaagcttgaagctacaataaaatcattatctgcagaacttctgaaggcaaat gaaattatcaagaagttacaaggggatctgaaaactttaatgggtaagttgaaattgaag aatacagttactattcagcaagaaaaactcttggctgagaaggaggaaaaattacaaaag gaacaaaaggaattacaagatgttggacagtctcttcgaattaaagagcaagaggtatgc aaattacaagaacaattagaagctacagttaaaaaacttgaagaaagcaaacaacttcta aaaaataatgaaaagttaatcacgtggttaaataaagaactaaatgaaaatcagctagtg agaaagcaagatgtattgggaccttctactactccgcctgcacattccagcagcaacaca atcagaagtggaatttctcctaacctgaatgtggttgatggtagactgacttacccaacc tgtgggattggttatcctgtctcctctgcatttgcattccagaataccttccctcattcg atatctgccaaaaataccagccaccctggttcaggaacaaaggttcagtttaatttgcag tttacaaaaccaaatgcatcactaggagatgttcagtcaggagcaactattagtatgcct tgctcaactgataaggaaaatggtgaaaatgtagggttggaatccaaatacctgaagaaa agggaagatagcattcctttacgcggactcagccagaacctatttagtaattcagaccat cagagagatggcactttaggagcattacatacatcttccaaacccacagcgctcccctct gcgtcttcagcctatttccctgggcagttaccaaacagttaa >gi568815597f:100033169_100248817|GENSCAN_predicted_peptide_4|353_aa MAGNIENLKLLGPRRCFVEFGAGKGKLSHWVDIALKDAEKVHFILVEKVTTRFKKRDPGK YINDCSEVLKYEKGLHRVQHGFKEGSDKFYCREEMENDYVEKVMLELNLERQLILISISD KIPVLREEKLPVVGIGKHLCGMATDLALRCLVETYAASFEERNEEPLAKRIKNDKTEKEI YTLAKEGNEKNVPEKWNPVAGIVIALCCHHRCDWRHYVGKEYFRALGLGAVEFHYFQRMS SWATCGMRKTSLETSNSTTKRQDNQNDDSEEHDDGGYRITDDGADCLPGLLSVEEKKKIG HLCKLLIDQGRIQYLQQKGFSPALQYYTDPLVSLENVLLTALPNHSSSPETTA >gi568815597f:100033169_100248817|GENSCAN_predicted_CDS_4|1062_bp atggcaggtaacattgaaaatttaaagttacttggtccaagaagatgctttgttgagttt ggagcgggaaagggaaaattatctcattgggttgatattgccttaaaagatgctgaaaaa gttcacttcatcctagtggaaaaggtgaccacaagattcaagaaaagagacccaggcaaa tacataaatgattgtagtgaagtgttaaaatacgagaaaggtctgcacagagtacagcat ggtttcaaagaagggagtgacaagttctattgcagggaagagatggagaatgactatgta gaaaaggtgatgcttgaactaaatcttgaaagacagttaattctcatttccatttcagac aagattcctgtgctaagagaagaaaaactacctgtggtaggaattggaaagcatctgtgt ggtatggcaacagatcttgcattacgatgtttggttgaaacctatgctgccagttttgag gaaaggaatgaagaacctttagccaaacgcataaagaatgataaaacagaaaaagaaatt tacactttggccaaggaaggaaatgaaaaaaatgtcccagagaagtggaaccctgtggct ggcattgttattgcactctgttgtcaccacaggtgtgattggagacattatgtgggcaaa gaatatttcagggctctaggccttggagcagtggaattccattatttccagcgaatgagt agttgggcaacttgtgggatgcggaaaacatctttggaaacctcaaatagtaccacaaag aggcaagataatcagaatgatgatagtgaagagcatgatgatggaggatacagaatcaca gatgatggcgctgattgtttgcctgggcttcttagtgttgaagaaaagaagaaaataggg catctttgtaaattgctgattgaccaaggtcgaatccagtatttgcagcagaagggattc agtcctgctttgcagtactatacagaccctctggtgtctttggaaaatgttttgttaact gctttaccaaatcattcttcatcaccagaaacaactgcttaa >gi568815597f:100033169_100248817|GENSCAN_predicted_peptide_5|328_aa MTENVVCTGAVNAVKEVWEKRIKKLNEDLKREKEFQHKLVRIWEERVSLTKLREKVTRED GRVILKIEKEEWKTLPSSLLKLNQLQEWQLHRTGLLKIPEFIGRFQNLIVLDLSRNTISE IPPGIGLLTRLQELILSYNKIKTVPKELSNCASLEKLELAVNRDICDLPQELSNLLKLTH LDLSMNDFTTIPLAVLNMPALEWLDMGSNKLEQLPDTIERFVNFRDNPLKLKVSLPPSEG TDEEEERELFGLQFMHTYIQESRRRAVNLMHLWNMQVVVIVQVNLKFKKRFVESENSAGQ EIAVCISHSHFLMICEDKLTVRKTIILT >gi568815597f:100033169_100248817|GENSCAN_predicted_CDS_5|987_bp atgacagaaaatgtggtttgtactggggctgtcaatgctgtaaaggaagtttgggaaaaa agaataaagaaactcaatgaagacctgaagcgagagaaggaatttcaacacaagctagtg cggatctgggaagaacgagtaagcttaaccaagctaagagaaaaggtcaccagggaagat ggaagagtcattttgaagatagaaaaagaggaatggaagaccctcccttcttctctgctg aaactgaatcaactacaggaatggcaacttcatagaactggtttgctgaaaattcctgaa ttcattggaagattccagaacctcattgtgttagatttatctcgaaacacaatttcagag ataccaccagggattggactgcttactagacttcaggaactgattctcagctacaacaaa atcaagactgtccccaaggaactaagtaattgtgccagcttggagaaactagaactggct gttaacagagatatatgtgatcttccacaagagctcagcaatctgctaaaacttactcac cttgatctgagtatgaacgattttactacaatccctcttgctgtgttgaacatgcctgcc cttgagtggctggacatgggaagcaacaaacttgaacaacttcctgatactatagaaagg tttgtcaacttcagagacaacccactgaaattgaaagtatcacttcctcccagtgaaggc acagatgaagaagaggaacgggaattatttggccttcagtttatgcacacatacatacaa gagtcacggagaagagcagtaaatttgatgcacctgtggaacatgcaagttgtggtaatt gtgcaggtgaatctgaagttcaagaagaggtttgttgaatctgaaaattctgcaggacaa gaaatagctgtatgcatatcccacagtcactttctcatgatatgtgaagacaaattaaca gttaggaaaacaataatcctgacctaa >gi568815597f:100033169_100248817|GENSCAN_predicted_peptide_6|577_aa ICVRYFQTCGNVHVLKPNYVCFFGYPSFKYSHPHHFLKTTAALRGQVVQFKLSDIGEGIR EVTVKEWYVKEGDTVSQFDSICEVQSDKASVTITSRYDGVIKKLYYNLDDIAYVGKPLVD IETEALKDSEEDVVETPAVSHDEHTHQEIKGRKTLATPAVRRLAMENNIKLSEVVGSGKD GRILKEDILNYLEKQTGAILPPSPKVEIMPPPPKPKDMTVPILVSKPPVFTGKDKTEPIK GFQKAMVKTMSAALKIPHFGYCDEIDLTELVKLREELKPIAFARGIKLSFMPFFLKPPKV LEFQAEGGSTVGASSLPQGKTANEPKGEQDDGQEDAQEGEAVLQPPDPADGTASHDHDRV GRIADDGPAVDVVDPGMASHNIGIAMDTEQGLIVPNVKNVQICSIFDIATELNRLQKLGS VSQLSTTDLTGGTFTLSNIGSIGGTFAKPVIMPPEVAIGALGSIKSGPSSAGLLEFAGGP FQTLFAWVSPVEAVEQQRLLPASSSGSFIPEGHLPDASESSPAIPRFNQKGEVYKAQIMN VSWSADHRVIDGATMSRFSNLWKSYLENPAFMLLDLK >gi568815597f:100033169_100248817|GENSCAN_predicted_CDS_6|1734_bp atttgtgttcgctattttcaaacatgtggtaatgttcatgttttgaagccaaattatgtg tgtttctttggttatccttcattcaagtatagtcatccacatcacttcctgaaaacaact gctgctctccgtggacaggttgttcagttcaagctctcagacattggagaagggattaga gaagtaactgttaaagaatggtatgtaaaagaaggagatacagtgtctcagtttgatagc atctgtgaagttcaaagtgataaagcttctgttaccatcactagtcgttatgatggagtc attaaaaaactctattataatctagacgatattgcctatgtggggaagccattagtagac atagaaacggaagctttaaaagattcagaagaagatgttgttgaaactcctgcagtgtct catgatgaacatacacaccaagagataaagggccgaaaaacactggcaactcctgcagtt cgccgtctggcaatggaaaacaatattaagctgagtgaagttgttggctcaggaaaagat ggcagaatacttaaagaagatatcctcaactatttggaaaagcagacaggagctatattg cctccttcacccaaagttgaaattatgccacctccaccaaagccaaaagacatgactgtt cctatactagtatcaaaacctccggtattcacaggcaaagacaaaacagaacccataaaa ggctttcaaaaagcaatggtcaagactatgtctgcagccctgaagatacctcattttggt tattgtgatgagattgaccttactgaactggttaagctccgagaagaattaaaacccatt gcatttgctcgtggaattaaactctcctttatgcctttcttcttaaagcctcccaaagtg ctggaattccaggccgaaggtggctccacagttggggcatcttcgctccctcaaggcaaa acagcaaatgaacccaaaggggaacaggatgatggccaggaagatgcccaggaaggtgaa gcagtcctgcagccccccgaccctgcagacgggacagcctcccacgaccacgatagagtt ggtaggatagcggatgacggtccagctgtggatgttgtagaccctgggatggcttctcat aacattgggatagcaatggatactgagcagggtttgattgtccctaatgtgaaaaatgtt cagatctgctctatatttgacatcgccactgaactgaaccgcctccagaaattgggctct gtgagtcagctcagcaccactgatcttacaggaggaacatttactctttccaacattgga tcaattggtggtacctttgccaaaccagtgataatgccacctgaagtagccattggggcc cttggatcaattaagtcaggcccctcttctgcaggtctgctggagtttgctgggggtcca ttccagaccctgtttgcctgggtatcaccagtggaggctgtagaacagcaaagattgctg cctgcttcttcctctggaagcttcatcccagaggggcacctgccagatgccagcgagagc tctcctgccattccccgatttaaccagaaaggagaagtatataaggcacagataatgaat gtgagctggtcagctgatcacagagttattgatggtgctacaatgtcacgcttctccaat ttgtggaaatcctatttagaaaacccagcttttatgctactagatctgaaatga