GENSCAN 1.0 Date run: 6-Nov-116 Time: 18:05:33 Sequence gi568815596r:208028206_208229685 : 201480 bp : 42.61% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 108 103 6 1.05 1.02 Term - 3338 3112 227 0 2 12 36 226 0.739 5.76 1.01 Init - 3748 3586 163 1 1 76 -20 206 0.826 8.54 1.00 Prom - 14114 14075 40 -6.35 2.03 PlyA - 15569 15564 6 1.05 2.02 Term - 19271 19203 69 2 0 49 42 100 0.434 -1.54 2.01 Init - 22833 22672 162 0 0 67 71 129 0.366 8.88 2.00 Prom - 24870 24831 40 -7.05 3.02 PlyA - 24980 24975 6 1.05 3.01 Sngl - 29540 29199 342 2 0 90 48 401 0.994 32.08 3.00 Prom - 29970 29931 40 -7.65 4.00 Prom + 30245 30284 40 -3.65 4.01 Init + 31747 31808 62 0 2 87 67 48 0.568 3.39 4.02 Intr + 33614 33769 156 2 0 108 73 27 0.009 1.40 4.03 Intr + 42507 42582 76 0 1 84 47 50 0.000 -0.90 4.04 Intr + 49260 49392 133 2 1 67 84 120 0.010 8.80 4.05 Term + 78235 78434 200 2 2 85 52 145 0.430 7.18 4.06 PlyA + 79600 79605 6 1.05 5.05 PlyA - 79981 79976 6 1.05 5.04 Term - 80291 80019 273 2 0 75 44 269 0.993 15.69 5.03 Intr - 84144 84061 84 0 0 13 94 165 0.996 8.70 5.02 Intr - 84303 84208 96 0 0 113 70 212 0.553 21.19 5.01 Init - 84417 84409 9 0 0 100 105 10 0.999 3.89 5.00 Prom - 84481 84442 40 -3.95 6.05 PlyA - 84603 84598 6 1.05 6.04 Term - 85598 85332 267 2 0 30 36 204 0.051 3.91 6.03 Intr - 93653 93504 150 2 0 84 -6 349 0.035 24.54 6.02 Intr - 96149 95907 243 2 0 118 94 482 0.999 48.57 6.01 Init - 96268 96260 9 1 0 100 105 10 0.989 3.89 6.00 Prom - 96354 96315 40 -4.85 7.06 PlyA - 96614 96609 6 1.05 7.05 Term - 100270 99998 273 1 0 -15 38 308 0.881 9.99 7.04 Intr - 101478 101236 243 0 0 90 89 355 0.976 32.57 7.03 Intr - 101628 101579 50 1 2 27 105 82 0.046 1.28 7.02 Intr - 107714 107346 369 0 0 86 36 344 0.061 23.15 7.01 Init - 109492 109387 106 1 1 54 77 67 0.703 2.64 7.00 Prom - 110145 110106 40 -7.55 8.07 PlyA - 112272 112267 6 1.05 8.06 Term - 114708 114433 276 0 0 70 45 173 0.915 5.68 8.05 Intr - 117811 117569 243 1 0 91 117 349 0.999 34.87 8.04 Intr - 119456 119231 226 1 1 36 47 140 0.002 1.76 8.03 Intr - 132871 132666 206 1 2 58 76 171 0.044 9.98 8.02 Intr - 135241 134999 243 1 0 95 84 436 0.999 40.67 8.01 Init - 135351 135343 9 0 0 102 105 10 0.989 4.09 8.00 Prom - 140828 140789 40 -4.35 9.07 PlyA - 141655 141650 6 1.05 9.06 Term - 144104 143961 144 2 0 45 44 143 0.503 2.53 9.05 Intr - 152611 152540 72 1 0 89 116 46 0.916 6.18 9.04 Intr - 153100 153013 88 0 1 65 91 48 0.768 1.85 9.03 Intr - 154842 154760 83 0 2 93 93 27 0.641 1.22 9.02 Intr - 161866 161748 119 2 2 65 85 115 0.117 8.16 9.01 Init - 167805 167715 91 0 1 70 63 56 0.064 1.33 9.00 Prom - 167901 167862 40 -4.65 10.00 Prom + 177821 177860 40 -5.95 10.01 Init + 180029 180249 221 1 2 33 65 283 0.853 18.65 10.02 Term + 180508 180748 241 1 1 67 47 318 0.917 20.11 10.03 PlyA + 180829 180834 6 1.05 11.06 PlyA - 181780 181775 6 1.05 11.05 Term - 191001 190715 287 1 2 62 39 182 0.582 5.28 11.04 Intr - 193058 192929 130 2 1 83 54 79 0.891 3.35 11.03 Intr - 193407 193178 230 1 2 59 102 95 0.523 4.67 11.02 Intr - 194428 194339 90 2 0 94 34 85 0.027 2.75 11.01 Intr - 199873 199738 136 1 1 67 45 133 0.027 6.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 33238 33667 430 1 1 109 43 117 0.894 3.09 S.002 Init - 44479 44356 124 1 1 71 96 84 0.908 7.88 S.003 Init + 49182 49392 211 2 1 60 84 132 0.856 9.00 S.004 Sngl - 85622 85332 291 2 0 49 36 220 0.864 8.30 S.005 Init - 101587 101579 9 1 0 100 105 10 0.893 3.89 S.006 Term - 107714 107338 377 0 2 86 41 347 0.920 23.72 S.007 Init - 119482 119231 252 1 0 51 47 165 0.859 6.09 S.008 Term - 132871 132599 273 1 0 58 43 201 0.949 7.09 S.009 Sngl + 178254 178439 186 2 0 29 54 250 0.985 10.53 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:208028206_208229685|GENSCAN_predicted_peptide_1|129_aa MKTILSNQTVDIPENVDITLKGRTVIVKGPRGGLWRDFSHINVELSLLRKKEAPAQKDEL ILEGNNIQLVSNSAASIQQATTVKNKDIRKILDGTYVSEKGTVQQAEDLRVVQLQKQDAG RLLRPICDI >gi568815596r:208028206_208229685|GENSCAN_predicted_CDS_1|390_bp atgaagaccattctcagcaaccagactgttgacattccagaaaatgtcgacatcactctg aagggacgcacagttatcgtgaagggccccagaggaggcctgtggagggacttcagtcac atcaatgtagaactcagccttcttaggaagaaagaggctccggcccagaaagatgaatta atccttgaaggaaacaacattcagcttgtttcaaattcagctgcttcgattcagcaagcc acaacagttaaaaacaaggatatcagaaaaattttggatggtacctatgtctctgaaaaa ggaacagttcagcaggctgaagatctaagagttgtccagctacagaaacaagatgctgga agactcctaagacctatttgtgatatttaa >gi568815596r:208028206_208229685|GENSCAN_predicted_peptide_2|76_aa MEQVQAEEMPDAYKTIRSPETYSLLREQHGGTNPMIQLPPPGPTLDTWRLLQLKAWKSKI KALADSVSGEGLLASL >gi568815596r:208028206_208229685|GENSCAN_predicted_CDS_2|231_bp atggagcaagtgcaagcagaggaaatgccagatgcttataaaaccatcagatctcctgag acttactcattattacgagaacagcatgggggaaccaatcccatgatccagttacctcca cctggtcccacccttgacacatggagattattacaactcaaggcttggaagtccaagatc aaggcactggcagattcggtgtctggtgaaggcctgcttgcttccctttag >gi568815596r:208028206_208229685|GENSCAN_predicted_peptide_3|113_aa MLQKFDPNGIKVVYLRCTGGEVSATSALDPKIGPLGLSPKKIGDDIAKAMGDWKGLRITV KLTIQNRQAQIEVVPSASALIIKALKEPPRDRKKQKNIKHSGNITFDEIINIA >gi568815596r:208028206_208229685|GENSCAN_predicted_CDS_3|342_bp atgctgcagaagttcgaccccaacgggatcaaagtcgtatacctgaggtgcactgggggt gaagtcagtgccacgtctgcgctggaccccaagatcggcccactgggtctgtctccaaaa aagattggtgatgacattgccaaggcaatgggtgactggaagggcctgagaattacagtg aaactgaccattcagaacagacaggcccagattgaggtagtgccttctgcctctgccctg atcatcaaagccctcaaggaaccaccaagagacagaaagaaacagaaaaacattaaacac agtgggaatatcacttttgatgagatcatcaacattgcttga >gi568815596r:208028206_208229685|GENSCAN_predicted_peptide_4|208_aa MVLASAWLLVNPQGAFIHGGRPPHPAIMLYRSTCLFLPELSLSLEWNLLKQGCGLSCNPL RVQLQEQFFTYGRKLLGSPTEHIRLHPISQKHGHILLLHEETHIVNFCSKNYCKNIPGKP KEFTDPLKEEFAAANSKTQLKNPLHAQTPVGLAANAFSGNFCTLPLTGGRVERLWEFVSL SPLASDRRNWCRNTSTQIVGLKEGEVEV >gi568815596r:208028206_208229685|GENSCAN_predicted_CDS_4|627_bp atggtgctggcatctgcttggcttctggtgaatcctcagggagcttttattcatggtgga agacctcctcaccctgcaattatgttgtatcgttcaacttgtttgtttctgcctgaactg tctctgtctctagaatggaatctcctcaagcagggatgtggtctttcttgtaatccacta agagttcagctccaagagcaattcttcacatacggaaggaagcttctaggaagtcccact gaacatatccgtttacatcccattagccagaaacatggccacatcctgctgctacatgag gagactcacattgtgaacttttgctccaagaactactgcaagaacataccaggaaaacca aaagaattcacagaccctttgaaagaagagtttgccgctgcaaactccaagacacagctg aaaaaccctctgcatgctcagacccctgtgggcttggctgcaaatgctttctctggaaac ttctgcacattacccctcactggggggagagtggaaaggctctgggagtttgtgtccctc tccccccttgccagtgacagaaggaactggtgtaggaacacaagcacccagatcgttggc ctcaaggagggagaggttgaagtgtga >gi568815596r:208028206_208229685|GENSCAN_predicted_peptide_5|153_aa MGKITLYEDRGFQGRHYECSSDHPNLQPYLSRCNSFLRRGDYADHQQWMGLSDSVRSCRL IPHASSHRLRIYEREDYRGQMVEITEDCSSLHDRFHLSEIHSFNVLEGSWVLYELPNYQG RQYLLRPGDCRWCQDWGATDARVGSLRRAVELY >gi568815596r:208028206_208229685|GENSCAN_predicted_CDS_5|462_bp atggggaagatcaccctctacgaggaccggggcttccagggccgccactacgaatgcagc agcgaccaccccaacctgcagccctacttgagccgctgcaactcgttcctgcgccgcggc gactatgccgaccaccagcagtggatgggcctcagcgactcggtccgctcctgccgcctc atcccccacgccagctcccacaggctcaggatctatgagcgagaggactacaggggccag atggtggagatcactgaggactgctcctctcttcacgaccgcttccacctcagtgagatc cactccttcaacgtgctggagggctcctgggtcctctacgagctgcccaactaccagggg cggcagtacctgctgaggccgggggactgcaggtggtgtcaggactggggggccacggat gcgagagtgggctccctaaggagagctgtggagctctactga >gi568815596r:208028206_208229685|GENSCAN_predicted_peptide_6|222_aa MGKITLYEDRGFQGRHYECSSDHPNLQPYLSRCNSARVDSGCWMLYEQPNYSGLQYFLRR GDYADHQQWMGLSDSVRSCRLIPHDRFRFNEIHSLNVLEGSWVLYELSNYRGRQYLLMPG DYRRYQDWGATNARGTFGNVCRQFCLSRPDSNATGTQMVEARMLLNVLQCTGQPPHPADN FLGPKCQKSAAVEKPCSKESGTPGGTGSRQKEGRPQSLESEP >gi568815596r:208028206_208229685|GENSCAN_predicted_CDS_6|669_bp atggggaagatcaccctctacgaggaccggggcttccagggccgccactatgaatgcagc agcgaccaccccaacctgcagccctacttgagccgctgcaactcggcgcgcgtggacagc ggctgctggatgctctatgagcagcccaactactcgggcctccagtacttcctgcgccgc ggcgactatgccgaccaccagcagtggatgggcctcagcgactcggtccgctcctgccgc ctcatcccccacgaccgcttccgcttcaatgaaatccactccctcaacgtgctggagggc tcctgggtcctctacgagctgtccaactaccgaggacggcagtacctgctgatgccaggg gactataggcgctaccaggactggggggccacgaatgccagaggaacctttggcaatgtc tgcagacagttctgcttgtcacgaccggacagtaatgctaccggtacccagatggtagag gccaggatgctgctcaatgtcctgcaatgcacaggacagcctccccacccagcagataat tttttgggccccaaatgtcaaaagagtgctgcggttgagaaaccttgctctaaagagagt ggaacaccaggagggacagggtcccggcaaaaggaaggaagaccccagtccctggagtca gagccttaa >gi568815596r:208028206_208229685|GENSCAN_predicted_peptide_7|346_aa MEVQVSSGGAGELAAQDPASALSKAPGRANNCKLHEATGSQCIVLRYTKLTDARTSVRPM NAREKGTVLYTDFGLEAIILMGQGDMRQALNNLQSIFPGSGFSYSEKYVQGLRRAPPPAH EGDDPALCECPCQGNLQDSCSPMASGLLTRRCHWQHLPSPITLNSHHPCQPAMGKITFYE DRAFQGRSYETTTDCPNLQPYFSRCNSIRVESGCWMLYERPNYQGQQYLLRRGEYPDYQQ WMGLSDSIRSCCLIPQTVSHRLRLYEREDHKGLMMELSEDCPSIQDRFHLSEIRSLHVLE GCWVLYELPNYRGRQYLLRPQEYRRCQDWGAMDAKAGSLRRVVDLY >gi568815596r:208028206_208229685|GENSCAN_predicted_CDS_7|1041_bp atggaggtgcaagtgagtagcggtggtgcaggcgagcttgcagcccaagaccctgcctct gccctcagcaaggcccctggcagggccaacaactgcaagctgcatgaggccactggatcc cagtgcatagtcctccgctacacaaagctgaccgatgcccggaccagtgtgaggccaatg aatgctagagagaagggtacagttctgtacactgactttggcctagaagccatcatcctc atgggtcagggagacatgagacaggccctgaacaacttgcagtccatcttcccaggatct ggcttcagttacagcgagaagtatgttcagggtctgcgacgagccccaccccctgctcat gaaggagatgatccagcactgtgtgaatgcccatgtcaaggaaacctacaagattcctgc tcacctatggcatctgggctactcaccagaagatgtcattggcaacatcttccaagcccc atcacactgaactcgcatcatccgtgtcaaccagccatggggaagatcaccttctatgag gacagggccttccagggccgcagctacgaaaccaccactgactgccccaacctgcagccg tatttcagccgctgcaactccatccgggtggagagcggctgctggatgctctatgagcgt cccaactaccaaggtcaacaatacttgctgcggcgaggggagtaccccgactaccagcaa tggatgggcctcagcgactccatccgctcctgttgtctcatcccccaaacagtctcccac aggctgcggctgtacgagagggaagaccacaaaggcctcatgatggagctgagtgaagac tgccccagcatccaggaccgcttccacctcagcgagatccgttccctccacgtgctggag ggctgctgggtcctctacgagctgcccaactaccgggggcggcaatacctgctgaggccc caagagtacaggcggtgccaggactggggggccatggatgctaaggcaggctctttgcgg agagtggtggatttgtattaa >gi568815596r:208028206_208229685|GENSCAN_predicted_peptide_8|400_aa MGKITFYEDRDFQGRCYNCISDCPNLRVYFSRCNSIRVDSGCWMLYERPNYQGHQYFLRR GKYPDYQHWMGLSDSVQSCRIIPHTSSHKLRLYERDDYRGLMSELTDDCACVPELFRLPE IYSLHVLEGCWVLYEMPNYRGRQYLLRPGDYRRAWKEKDWKIRDKKVWGRGMCREIWDWA QRAKIVVSHVKAQCRASTIEEALNNQVEKMIWSFDSSQRLPSATLLLAITFYEDRAFQGR SYECTTDCPNLQPYFSRCNSIRVESGCWMIYERPNYQGHQYFLRRGEYPDYQQWMGLSDS IRSCCLIPPHSGAYRMKIYDRDELRGQMSELTDDCISVQDRFHLTEIHSLNVLEGSWILY EMPNYRGRQYLLRPGEYRRFLDWGAPNAKVGSLRRVMDLY >gi568815596r:208028206_208229685|GENSCAN_predicted_CDS_8|1203_bp atggggaagatcaccttctacgaggaccgagactttcagggtcgctgctacaattgcatc agtgactgccccaacctgcgggtctacttcagccgctgcaactccatccgagtagacagc ggctgctggatgctctatgagcgtcccaattaccagggccaccagtacttcctgcgccga ggcaagtaccccgactatcagcactggatgggcctcagcgactcggtccaatcctgccgt ataattcctcataccagctcgcacaagttaaggctgtacgagagagatgactaccgaggc cttatgtctgagctcactgatgactgcgcctgtgttccagaactgttccgtctccctgag atctattccctccacgtactggagggctgctgggtcctctatgaaatgcccaactaccgg gggcggcagtatctgctgaggcctggggactacagaagggcctggaaggagaaagattgg aagattagggacaagaaggtctggggtagaggaatgtgcagggagatatgggattgggca caaagggcgaagatagttgtatcccatgtgaaggcccagtgtagggcatccaccattgaa gaggcactaaacaaccaagtagaaaaaatgatttggtcatttgattccagtcagcgtctg ccatcagccaccctactgctggcaatcaccttctacgaggacagggccttccagggccgc agctacgaatgcaccactgactgccccaacctacaaccctatttcagccgctgcaactcc atcagggtggagagcggctgctggatgatctatgagcgccccaactaccagggccaccag tacttcctgcggcgtggggagtaccctgactaccagcaatggatgggcctcagcgactcc atccgctcctgctgcctcatccccccgcactctggcgcttacagaatgaagatctacgac agagatgaattgaggggacaaatgtcagagctcacagacgactgtatctctgttcaggac cgcttccacctcactgaaattcactccctcaatgtgctggagggcagctggatcctctat gagatgcccaactacagggggaggcagtatctgctgaggccgggggagtacaggaggttt cttgattggggggctccaaatgccaaagttggctctcttagacgagtcatggatttgtac tga >gi568815596r:208028206_208229685|GENSCAN_predicted_peptide_9|198_aa MGNAKRKQHGPSLRAILHVLMVVLVQPVRPPTRKKHVFLMSVSTSLRCADASACLQKQRK NKSTGTGLRLAHYDLAISVALQWLDPSEDLTWLEWEELKIPLHGRPIYPNRREREAMILS SYAGILMNSIPIEEVFKIYGADSSADSGTIKGILGNGNQVQEQKAMCASSRSYTSRAPSM DCGQCVRWTVLNNWTQGF >gi568815596r:208028206_208229685|GENSCAN_predicted_CDS_9|597_bp atggggaatgcgaagagaaagcagcatggccccagcctgagggccattctccatgtcctg atggtggtcctggttcagcctgtgaggccaccaaccaggaaaaagcatgtgttcctgatg tctgtgagcaccagtctgagatgcgcggatgcttcagcatgcctccagaagcagagaaag aacaaaagcactggaaccggtctccggttggcacactatgacttggccatcagtgttgct ttgcaatggctggatccctcagaagacttaacttggctggagtgggaggaactgaaaata ccactccatggcagacccatatatccaaatcgtagagaacgagaagctatgattttatca tcttatgctggaatcttaatgaacagtatcccgattgaggaagtctttaaaatttatggg gctgattcttctgccgattctggtaccatcaagggcattctaggaaatgggaaccaagta caggaacaaaaggccatgtgtgcaagttcacgttcatatacttcacgtgccccatcaatg gactgtggacagtgtgtccgttggacagtactcaacaattggactcaaggattctga >gi568815596r:208028206_208229685|GENSCAN_predicted_peptide_10|153_aa MEQYITKGKVTIINLKRTREKFLLAARAIENPSEVNVMSSRNTGQGAVLKVAAVMGATLI AGSFTPRTFTHQIRDPGEIEKEEQATAGKAETQEEFQGEWTAPTPEFAATEPEVADWPEV AQVPSGPIQQVPTEDWSTQPAAETGLQLPCSGQ >gi568815596r:208028206_208229685|GENSCAN_predicted_CDS_10|462_bp atggaacagtacattacaaaaggaaaagtgaccatcatcaatctgaagaggacccgggag aagtttctgctggcagctcgtgccattgaaaacccctctgaggtcaatgtcatgtcctct aggaacactggccagggggctgtgctgaaggttgctgctgtcatgggagccactcttatt gctggtagcttcactcctagaaccttcactcaccagatccgagatcctggagagattgaa aaagaagagcaggccacagctggaaaagctgagacccaggaggaatttcagggtgaatgg actgctccaactcctgagttcgctgctactgagcctgaggttgcagactggcctgaagtt gcgcaggtgccctctgggcctattcagcaggtccccactgaagactggagcactcagcct gccgctgaaactggtctgcagctcccgtgctcaggccagtga >gi568815596r:208028206_208229685|GENSCAN_predicted_peptide_11|290_aa RGVEGEALAGIGAACGAHGPVRVPGGRRLRGPYTWSSRPEPLAEGTAIMIIHQDLISHDE MFCNIYKIQEMAVPGARLHPGEINSHVAHTKPVWWSLHTNTSEIWCRDSDRGTSLQRSIP CPPALCSVKKIHLRPQVLRPTSPRNISPILNRRQRRHILSMDPKTPAPVTDWEGSLPLVF NHCRDSSLIIHPRFRGRDILTKLSAPLTIPGVQLHLIAALLPNPMPPLRLPLIPPYLNPQ VWDIATPSLATGHMPITIPLKPNHPYPTQRQYPIPQHNLKGLKPVITHLL >gi568815596r:208028206_208229685|GENSCAN_predicted_CDS_11|873_bp agaggtgtggagggagaggcgctggcgggaatcggggctgcgtgtggcgctcatgggcca gtgcgagttccgggtgggcgcaggctccgtgggccctacacttggagcagccggccggag ccactggccgagggcaccgcaattatgattattcaccaggacctcatcagccacgatgag atgttctgcaacatttacaagatccaggagatggctgtgcctggagcccgcctgcaccca ggtgaaataaacagccatgttgctcacacaaagcctgtttggtggtctcttcacaccaac acaagtgaaatttggtgccgtgactcggatcgggggacctcccttcagagatcaatccct tgtcctcctgctctttgctccgtgaaaaagatccacctacgacctcaggtcctcagaccc accagcccaaggaacatctcaccaattttaaatcggagacaaaggagacacattttatcc atggacccaaaaactccagcgccggtcacggactgggaaggcagccttcccttggtgttt aatcattgcagggactcctctctgattattcatccacgtttcagaggtcgagatatttta accaaattatctgctcccctgactattcctggagtacagctgcatctcattgctgccctt ctccccaacccaatgcctcctttgcgtcttcctctcatacccccctaccttaacccacaa gtatgggacatcgctactccttccctggcaaccggtcacatgcccattaccatcccatta aaacctaaccacccttaccccactcaacgccaatatcccatcccacagcacaatttaaaa ggcttgaagcctgttatcactcacctgctatag