GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:07:39 Sequence gi568815596r:208021676_208224356 : 202681 bp : 42.57% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1779 1890 112 0 1 52 33 149 0.717 2.85 1.02 PlyA + 2470 2475 6 1.05 2.03 PlyA - 3196 3191 6 1.05 2.02 Term - 9868 9642 227 2 2 12 36 226 0.726 5.76 2.01 Init - 10278 10116 163 0 1 76 -20 206 0.827 8.54 2.00 Prom - 20644 20605 40 -6.35 3.03 PlyA - 22099 22094 6 1.05 3.02 Term - 25801 25733 69 1 0 49 42 100 0.433 -1.54 3.01 Init - 29363 29202 162 2 0 67 71 129 0.366 8.88 3.00 Prom - 31400 31361 40 -7.05 4.02 PlyA - 31510 31505 6 1.05 4.01 Sngl - 36070 35729 342 1 0 90 48 401 0.994 32.08 4.00 Prom - 36500 36461 40 -7.65 5.00 Prom + 36775 36814 40 -3.65 5.01 Init + 38277 38338 62 2 2 87 67 48 0.568 3.39 5.02 Intr + 40144 40299 156 1 0 108 73 27 0.009 1.40 5.03 Intr + 49037 49112 76 2 1 84 47 50 0.000 -0.90 5.04 Intr + 55790 55922 133 1 1 67 84 120 0.010 8.80 5.05 Term + 84765 84964 200 1 2 85 52 145 0.430 7.18 5.06 PlyA + 86130 86135 6 1.05 6.05 PlyA - 86511 86506 6 1.05 6.04 Term - 86821 86549 273 1 0 75 44 269 0.993 15.69 6.03 Intr - 90674 90591 84 2 0 13 94 165 0.996 8.70 6.02 Intr - 90833 90738 96 2 0 113 70 212 0.553 21.19 6.01 Init - 90947 90939 9 2 0 100 105 10 0.999 3.89 6.00 Prom - 91011 90972 40 -3.95 7.05 PlyA - 91133 91128 6 1.05 7.04 Term - 92128 91862 267 1 0 30 36 204 0.051 3.91 7.03 Intr - 100183 100034 150 1 0 84 -6 349 0.035 24.54 7.02 Intr - 102679 102437 243 1 0 118 94 482 0.999 48.57 7.01 Init - 102798 102790 9 0 0 100 105 10 0.989 3.89 7.00 Prom - 102884 102845 40 -4.85 8.06 PlyA - 103144 103139 6 1.05 8.05 Term - 106800 106528 273 0 0 -15 38 308 0.881 9.99 8.04 Intr - 108008 107766 243 2 0 90 89 355 0.976 32.57 8.03 Intr - 108158 108109 50 0 2 27 105 82 0.046 1.28 8.02 Intr - 114244 113876 369 2 0 86 36 344 0.061 23.15 8.01 Init - 116022 115917 106 0 1 54 77 67 0.703 2.64 8.00 Prom - 116675 116636 40 -7.55 9.07 PlyA - 118802 118797 6 1.05 9.06 Term - 121238 120963 276 2 0 70 45 173 0.915 5.68 9.05 Intr - 124341 124099 243 0 0 91 117 349 0.999 34.87 9.04 Intr - 125986 125761 226 0 1 36 47 140 0.002 1.76 9.03 Intr - 139401 139196 206 0 2 58 76 171 0.044 9.98 9.02 Intr - 141771 141529 243 0 0 95 84 436 0.999 40.67 9.01 Init - 141881 141873 9 2 0 102 105 10 0.989 4.09 9.00 Prom - 147358 147319 40 -4.35 10.07 PlyA - 148185 148180 6 1.05 10.06 Term - 150634 150491 144 1 0 45 44 143 0.503 2.53 10.05 Intr - 159141 159070 72 0 0 89 116 46 0.916 6.18 10.04 Intr - 159630 159543 88 2 1 65 91 48 0.768 1.85 10.03 Intr - 161372 161290 83 2 2 93 93 27 0.641 1.22 10.02 Intr - 168396 168278 119 1 2 65 85 115 0.117 8.16 10.01 Init - 174335 174245 91 2 1 70 63 56 0.064 1.33 10.00 Prom - 174431 174392 40 -4.65 11.00 Prom + 184351 184390 40 -5.95 11.01 Init + 186559 186779 221 0 2 33 65 283 0.853 18.65 11.02 Term + 187038 187278 241 0 1 67 47 318 0.917 20.11 11.03 PlyA + 187359 187364 6 1.05 12.05 PlyA - 188310 188305 6 1.05 12.04 Term - 197531 197245 287 0 2 62 39 182 0.582 5.28 12.03 Intr - 199588 199459 130 1 1 83 54 79 0.892 3.35 12.02 Intr - 199937 199708 230 0 2 59 102 95 0.524 4.67 12.01 Intr - 201059 200869 191 1 2 91 34 118 0.290 5.01 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 39768 40197 430 0 1 109 43 117 0.894 3.09 S.002 Init - 51009 50886 124 0 1 71 96 84 0.908 7.88 S.003 Init + 55712 55922 211 1 1 60 84 132 0.856 9.00 S.004 Sngl - 92152 91862 291 1 0 49 36 220 0.864 8.30 S.005 Init - 108117 108109 9 0 0 100 105 10 0.893 3.89 S.006 Term - 114244 113868 377 2 2 86 41 347 0.920 23.72 S.007 Init - 126012 125761 252 0 0 51 47 165 0.859 6.09 S.008 Term - 139401 139129 273 0 0 58 43 201 0.949 7.09 S.009 Sngl + 184784 184969 186 1 0 29 54 250 0.985 10.53 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:208021676_208224356|GENSCAN_predicted_peptide_1|37_aa XLVRVHGLLGTGPHNSERWTSHALLFELRLLSDRQQH >gi568815596r:208021676_208224356|GENSCAN_predicted_CDS_1|114_bp nnactagtacgggtccatggcctgttaggaactgggccacacaacagtgagcgatggacc agccacgcattgctgtttgagctccgcctcctgtcagatcggcagcagcattag >gi568815596r:208021676_208224356|GENSCAN_predicted_peptide_2|129_aa MKTILSNQTVDIPENVDITLKGRTVIVKGPRGGLWRDFSHINVELSLLRKKEAPAQKDEL ILEGNNIQLVSNSAASIQQATTVKNKDIRKILDGTYVSEKGTVQQAEDLRVVQLQKQDAG RLLRPICDI >gi568815596r:208021676_208224356|GENSCAN_predicted_CDS_2|390_bp atgaagaccattctcagcaaccagactgttgacattccagaaaatgtcgacatcactctg aagggacgcacagttatcgtgaagggccccagaggaggcctgtggagggacttcagtcac atcaatgtagaactcagccttcttaggaagaaagaggctccggcccagaaagatgaatta atccttgaaggaaacaacattcagcttgtttcaaattcagctgcttcgattcagcaagcc acaacagttaaaaacaaggatatcagaaaaattttggatggtacctatgtctctgaaaaa ggaacagttcagcaggctgaagatctaagagttgtccagctacagaaacaagatgctgga agactcctaagacctatttgtgatatttaa >gi568815596r:208021676_208224356|GENSCAN_predicted_peptide_3|76_aa MEQVQAEEMPDAYKTIRSPETYSLLREQHGGTNPMIQLPPPGPTLDTWRLLQLKAWKSKI KALADSVSGEGLLASL >gi568815596r:208021676_208224356|GENSCAN_predicted_CDS_3|231_bp atggagcaagtgcaagcagaggaaatgccagatgcttataaaaccatcagatctcctgag acttactcattattacgagaacagcatgggggaaccaatcccatgatccagttacctcca cctggtcccacccttgacacatggagattattacaactcaaggcttggaagtccaagatc aaggcactggcagattcggtgtctggtgaaggcctgcttgcttccctttag >gi568815596r:208021676_208224356|GENSCAN_predicted_peptide_4|113_aa MLQKFDPNGIKVVYLRCTGGEVSATSALDPKIGPLGLSPKKIGDDIAKAMGDWKGLRITV KLTIQNRQAQIEVVPSASALIIKALKEPPRDRKKQKNIKHSGNITFDEIINIA >gi568815596r:208021676_208224356|GENSCAN_predicted_CDS_4|342_bp atgctgcagaagttcgaccccaacgggatcaaagtcgtatacctgaggtgcactgggggt gaagtcagtgccacgtctgcgctggaccccaagatcggcccactgggtctgtctccaaaa aagattggtgatgacattgccaaggcaatgggtgactggaagggcctgagaattacagtg aaactgaccattcagaacagacaggcccagattgaggtagtgccttctgcctctgccctg atcatcaaagccctcaaggaaccaccaagagacagaaagaaacagaaaaacattaaacac agtgggaatatcacttttgatgagatcatcaacattgcttga >gi568815596r:208021676_208224356|GENSCAN_predicted_peptide_5|208_aa MVLASAWLLVNPQGAFIHGGRPPHPAIMLYRSTCLFLPELSLSLEWNLLKQGCGLSCNPL RVQLQEQFFTYGRKLLGSPTEHIRLHPISQKHGHILLLHEETHIVNFCSKNYCKNIPGKP KEFTDPLKEEFAAANSKTQLKNPLHAQTPVGLAANAFSGNFCTLPLTGGRVERLWEFVSL SPLASDRRNWCRNTSTQIVGLKEGEVEV >gi568815596r:208021676_208224356|GENSCAN_predicted_CDS_5|627_bp atggtgctggcatctgcttggcttctggtgaatcctcagggagcttttattcatggtgga agacctcctcaccctgcaattatgttgtatcgttcaacttgtttgtttctgcctgaactg tctctgtctctagaatggaatctcctcaagcagggatgtggtctttcttgtaatccacta agagttcagctccaagagcaattcttcacatacggaaggaagcttctaggaagtcccact gaacatatccgtttacatcccattagccagaaacatggccacatcctgctgctacatgag gagactcacattgtgaacttttgctccaagaactactgcaagaacataccaggaaaacca aaagaattcacagaccctttgaaagaagagtttgccgctgcaaactccaagacacagctg aaaaaccctctgcatgctcagacccctgtgggcttggctgcaaatgctttctctggaaac ttctgcacattacccctcactggggggagagtggaaaggctctgggagtttgtgtccctc tccccccttgccagtgacagaaggaactggtgtaggaacacaagcacccagatcgttggc ctcaaggagggagaggttgaagtgtga >gi568815596r:208021676_208224356|GENSCAN_predicted_peptide_6|153_aa MGKITLYEDRGFQGRHYECSSDHPNLQPYLSRCNSFLRRGDYADHQQWMGLSDSVRSCRL IPHASSHRLRIYEREDYRGQMVEITEDCSSLHDRFHLSEIHSFNVLEGSWVLYELPNYQG RQYLLRPGDCRWCQDWGATDARVGSLRRAVELY >gi568815596r:208021676_208224356|GENSCAN_predicted_CDS_6|462_bp atggggaagatcaccctctacgaggaccggggcttccagggccgccactacgaatgcagc agcgaccaccccaacctgcagccctacttgagccgctgcaactcgttcctgcgccgcggc gactatgccgaccaccagcagtggatgggcctcagcgactcggtccgctcctgccgcctc atcccccacgccagctcccacaggctcaggatctatgagcgagaggactacaggggccag atggtggagatcactgaggactgctcctctcttcacgaccgcttccacctcagtgagatc cactccttcaacgtgctggagggctcctgggtcctctacgagctgcccaactaccagggg cggcagtacctgctgaggccgggggactgcaggtggtgtcaggactggggggccacggat gcgagagtgggctccctaaggagagctgtggagctctactga >gi568815596r:208021676_208224356|GENSCAN_predicted_peptide_7|222_aa MGKITLYEDRGFQGRHYECSSDHPNLQPYLSRCNSARVDSGCWMLYEQPNYSGLQYFLRR GDYADHQQWMGLSDSVRSCRLIPHDRFRFNEIHSLNVLEGSWVLYELSNYRGRQYLLMPG DYRRYQDWGATNARGTFGNVCRQFCLSRPDSNATGTQMVEARMLLNVLQCTGQPPHPADN FLGPKCQKSAAVEKPCSKESGTPGGTGSRQKEGRPQSLESEP >gi568815596r:208021676_208224356|GENSCAN_predicted_CDS_7|669_bp atggggaagatcaccctctacgaggaccggggcttccagggccgccactatgaatgcagc agcgaccaccccaacctgcagccctacttgagccgctgcaactcggcgcgcgtggacagc ggctgctggatgctctatgagcagcccaactactcgggcctccagtacttcctgcgccgc ggcgactatgccgaccaccagcagtggatgggcctcagcgactcggtccgctcctgccgc ctcatcccccacgaccgcttccgcttcaatgaaatccactccctcaacgtgctggagggc tcctgggtcctctacgagctgtccaactaccgaggacggcagtacctgctgatgccaggg gactataggcgctaccaggactggggggccacgaatgccagaggaacctttggcaatgtc tgcagacagttctgcttgtcacgaccggacagtaatgctaccggtacccagatggtagag gccaggatgctgctcaatgtcctgcaatgcacaggacagcctccccacccagcagataat tttttgggccccaaatgtcaaaagagtgctgcggttgagaaaccttgctctaaagagagt ggaacaccaggagggacagggtcccggcaaaaggaaggaagaccccagtccctggagtca gagccttaa >gi568815596r:208021676_208224356|GENSCAN_predicted_peptide_8|346_aa MEVQVSSGGAGELAAQDPASALSKAPGRANNCKLHEATGSQCIVLRYTKLTDARTSVRPM NAREKGTVLYTDFGLEAIILMGQGDMRQALNNLQSIFPGSGFSYSEKYVQGLRRAPPPAH EGDDPALCECPCQGNLQDSCSPMASGLLTRRCHWQHLPSPITLNSHHPCQPAMGKITFYE DRAFQGRSYETTTDCPNLQPYFSRCNSIRVESGCWMLYERPNYQGQQYLLRRGEYPDYQQ WMGLSDSIRSCCLIPQTVSHRLRLYEREDHKGLMMELSEDCPSIQDRFHLSEIRSLHVLE GCWVLYELPNYRGRQYLLRPQEYRRCQDWGAMDAKAGSLRRVVDLY >gi568815596r:208021676_208224356|GENSCAN_predicted_CDS_8|1041_bp atggaggtgcaagtgagtagcggtggtgcaggcgagcttgcagcccaagaccctgcctct gccctcagcaaggcccctggcagggccaacaactgcaagctgcatgaggccactggatcc cagtgcatagtcctccgctacacaaagctgaccgatgcccggaccagtgtgaggccaatg aatgctagagagaagggtacagttctgtacactgactttggcctagaagccatcatcctc atgggtcagggagacatgagacaggccctgaacaacttgcagtccatcttcccaggatct ggcttcagttacagcgagaagtatgttcagggtctgcgacgagccccaccccctgctcat gaaggagatgatccagcactgtgtgaatgcccatgtcaaggaaacctacaagattcctgc tcacctatggcatctgggctactcaccagaagatgtcattggcaacatcttccaagcccc atcacactgaactcgcatcatccgtgtcaaccagccatggggaagatcaccttctatgag gacagggccttccagggccgcagctacgaaaccaccactgactgccccaacctgcagccg tatttcagccgctgcaactccatccgggtggagagcggctgctggatgctctatgagcgt cccaactaccaaggtcaacaatacttgctgcggcgaggggagtaccccgactaccagcaa tggatgggcctcagcgactccatccgctcctgttgtctcatcccccaaacagtctcccac aggctgcggctgtacgagagggaagaccacaaaggcctcatgatggagctgagtgaagac tgccccagcatccaggaccgcttccacctcagcgagatccgttccctccacgtgctggag ggctgctgggtcctctacgagctgcccaactaccgggggcggcaatacctgctgaggccc caagagtacaggcggtgccaggactggggggccatggatgctaaggcaggctctttgcgg agagtggtggatttgtattaa >gi568815596r:208021676_208224356|GENSCAN_predicted_peptide_9|400_aa MGKITFYEDRDFQGRCYNCISDCPNLRVYFSRCNSIRVDSGCWMLYERPNYQGHQYFLRR GKYPDYQHWMGLSDSVQSCRIIPHTSSHKLRLYERDDYRGLMSELTDDCACVPELFRLPE IYSLHVLEGCWVLYEMPNYRGRQYLLRPGDYRRAWKEKDWKIRDKKVWGRGMCREIWDWA QRAKIVVSHVKAQCRASTIEEALNNQVEKMIWSFDSSQRLPSATLLLAITFYEDRAFQGR SYECTTDCPNLQPYFSRCNSIRVESGCWMIYERPNYQGHQYFLRRGEYPDYQQWMGLSDS IRSCCLIPPHSGAYRMKIYDRDELRGQMSELTDDCISVQDRFHLTEIHSLNVLEGSWILY EMPNYRGRQYLLRPGEYRRFLDWGAPNAKVGSLRRVMDLY >gi568815596r:208021676_208224356|GENSCAN_predicted_CDS_9|1203_bp atggggaagatcaccttctacgaggaccgagactttcagggtcgctgctacaattgcatc agtgactgccccaacctgcgggtctacttcagccgctgcaactccatccgagtagacagc ggctgctggatgctctatgagcgtcccaattaccagggccaccagtacttcctgcgccga ggcaagtaccccgactatcagcactggatgggcctcagcgactcggtccaatcctgccgt ataattcctcataccagctcgcacaagttaaggctgtacgagagagatgactaccgaggc cttatgtctgagctcactgatgactgcgcctgtgttccagaactgttccgtctccctgag atctattccctccacgtactggagggctgctgggtcctctatgaaatgcccaactaccgg gggcggcagtatctgctgaggcctggggactacagaagggcctggaaggagaaagattgg aagattagggacaagaaggtctggggtagaggaatgtgcagggagatatgggattgggca caaagggcgaagatagttgtatcccatgtgaaggcccagtgtagggcatccaccattgaa gaggcactaaacaaccaagtagaaaaaatgatttggtcatttgattccagtcagcgtctg ccatcagccaccctactgctggcaatcaccttctacgaggacagggccttccagggccgc agctacgaatgcaccactgactgccccaacctacaaccctatttcagccgctgcaactcc atcagggtggagagcggctgctggatgatctatgagcgccccaactaccagggccaccag tacttcctgcggcgtggggagtaccctgactaccagcaatggatgggcctcagcgactcc atccgctcctgctgcctcatccccccgcactctggcgcttacagaatgaagatctacgac agagatgaattgaggggacaaatgtcagagctcacagacgactgtatctctgttcaggac cgcttccacctcactgaaattcactccctcaatgtgctggagggcagctggatcctctat gagatgcccaactacagggggaggcagtatctgctgaggccgggggagtacaggaggttt cttgattggggggctccaaatgccaaagttggctctcttagacgagtcatggatttgtac tga >gi568815596r:208021676_208224356|GENSCAN_predicted_peptide_10|198_aa MGNAKRKQHGPSLRAILHVLMVVLVQPVRPPTRKKHVFLMSVSTSLRCADASACLQKQRK NKSTGTGLRLAHYDLAISVALQWLDPSEDLTWLEWEELKIPLHGRPIYPNRREREAMILS SYAGILMNSIPIEEVFKIYGADSSADSGTIKGILGNGNQVQEQKAMCASSRSYTSRAPSM DCGQCVRWTVLNNWTQGF >gi568815596r:208021676_208224356|GENSCAN_predicted_CDS_10|597_bp atggggaatgcgaagagaaagcagcatggccccagcctgagggccattctccatgtcctg atggtggtcctggttcagcctgtgaggccaccaaccaggaaaaagcatgtgttcctgatg tctgtgagcaccagtctgagatgcgcggatgcttcagcatgcctccagaagcagagaaag aacaaaagcactggaaccggtctccggttggcacactatgacttggccatcagtgttgct ttgcaatggctggatccctcagaagacttaacttggctggagtgggaggaactgaaaata ccactccatggcagacccatatatccaaatcgtagagaacgagaagctatgattttatca tcttatgctggaatcttaatgaacagtatcccgattgaggaagtctttaaaatttatggg gctgattcttctgccgattctggtaccatcaagggcattctaggaaatgggaaccaagta caggaacaaaaggccatgtgtgcaagttcacgttcatatacttcacgtgccccatcaatg gactgtggacagtgtgtccgttggacagtactcaacaattggactcaaggattctga >gi568815596r:208021676_208224356|GENSCAN_predicted_peptide_11|153_aa MEQYITKGKVTIINLKRTREKFLLAARAIENPSEVNVMSSRNTGQGAVLKVAAVMGATLI AGSFTPRTFTHQIRDPGEIEKEEQATAGKAETQEEFQGEWTAPTPEFAATEPEVADWPEV AQVPSGPIQQVPTEDWSTQPAAETGLQLPCSGQ >gi568815596r:208021676_208224356|GENSCAN_predicted_CDS_11|462_bp atggaacagtacattacaaaaggaaaagtgaccatcatcaatctgaagaggacccgggag aagtttctgctggcagctcgtgccattgaaaacccctctgaggtcaatgtcatgtcctct aggaacactggccagggggctgtgctgaaggttgctgctgtcatgggagccactcttatt gctggtagcttcactcctagaaccttcactcaccagatccgagatcctggagagattgaa aaagaagagcaggccacagctggaaaagctgagacccaggaggaatttcagggtgaatgg actgctccaactcctgagttcgctgctactgagcctgaggttgcagactggcctgaagtt gcgcaggtgccctctgggcctattcagcaggtccccactgaagactggagcactcagcct gccgctgaaactggtctgcagctcccgtgctcaggccagtga >gi568815596r:208021676_208224356|GENSCAN_predicted_peptide_12|279_aa XTPIRAFPLAPLTPSTALLKLRAPAKLAPLPLPSAAIMIIHQDLISHDEMFCNIYKIQEM AVPGARLHPGEINSHVAHTKPVWWSLHTNTSEIWCRDSDRGTSLQRSIPCPPALCSVKKI HLRPQVLRPTSPRNISPILNRRQRRHILSMDPKTPAPVTDWEGSLPLVFNHCRDSSLIIH PRFRGRDILTKLSAPLTIPGVQLHLIAALLPNPMPPLRLPLIPPYLNPQVWDIATPSLAT GHMPITIPLKPNHPYPTQRQYPIPQHNLKGLKPVITHLL >gi568815596r:208021676_208224356|GENSCAN_predicted_CDS_12|840_bp nnaactcctattagggcttttccactggctcccctcacccccagcactgcactgctcaag ctcagggctcctgctaagctagccccgctgcctctcccttcagccgcaattatgattatt caccaggacctcatcagccacgatgagatgttctgcaacatttacaagatccaggagatg gctgtgcctggagcccgcctgcacccaggtgaaataaacagccatgttgctcacacaaag cctgtttggtggtctcttcacaccaacacaagtgaaatttggtgccgtgactcggatcgg gggacctcccttcagagatcaatcccttgtcctcctgctctttgctccgtgaaaaagatc cacctacgacctcaggtcctcagacccaccagcccaaggaacatctcaccaattttaaat cggagacaaaggagacacattttatccatggacccaaaaactccagcgccggtcacggac tgggaaggcagccttcccttggtgtttaatcattgcagggactcctctctgattattcat ccacgtttcagaggtcgagatattttaaccaaattatctgctcccctgactattcctgga gtacagctgcatctcattgctgcccttctccccaacccaatgcctcctttgcgtcttcct ctcatacccccctaccttaacccacaagtatgggacatcgctactccttccctggcaacc ggtcacatgcccattaccatcccattaaaacctaaccacccttaccccactcaacgccaa tatcccatcccacagcacaatttaaaaggcttgaagcctgttatcactcacctgctatag