GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:02:54 Sequence gi568815595f:40210892_40412011 : 201120 bp : 41.01% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 796 876 81 0 0 64 68 75 0.239 4.12 1.02 Intr + 4992 5054 63 2 0 63 94 49 0.036 1.00 1.03 Intr + 17757 17884 128 2 2 120 76 47 0.000 5.26 1.04 Intr + 20198 20403 206 2 2 77 98 85 0.000 6.32 1.05 Intr + 21278 21403 126 0 0 51 64 68 0.520 0.43 1.06 Intr + 23005 23392 388 2 1 69 95 277 0.502 19.42 1.07 Intr + 30472 30523 52 1 1 61 37 71 0.000 -2.71 1.08 Intr + 33495 33716 222 2 0 60 94 346 0.129 29.80 1.09 Intr + 39331 39435 105 0 0 84 89 56 0.714 4.79 1.10 Intr + 39548 39608 61 1 1 87 91 47 0.751 2.29 1.11 Intr + 40990 41108 119 2 2 91 54 85 0.077 4.66 1.12 Intr + 49845 49983 139 2 1 67 90 43 0.004 1.52 1.13 Intr + 58534 58674 141 2 0 81 69 102 0.499 7.00 1.14 Intr + 58825 59002 178 2 1 67 71 137 0.541 7.96 1.15 Intr + 61229 61409 181 2 1 -11 -4 208 0.303 0.65 1.16 Term + 61582 61815 234 0 0 72 45 257 0.888 15.14 1.17 PlyA + 62374 62379 6 1.05 2.00 Prom + 64364 64403 40 -5.75 2.01 Sngl + 64830 65924 1095 2 0 60 39 257 0.726 14.73 2.02 PlyA + 66605 66610 6 1.05 3.03 PlyA - 67053 67048 6 1.05 3.02 Term - 79774 79626 149 2 2 85 44 125 0.309 4.88 3.01 Init - 88772 88721 52 2 1 64 67 61 0.763 2.97 3.00 Prom - 90333 90294 40 -4.15 4.00 Prom + 92808 92847 40 -6.25 4.01 Init + 95636 95727 92 1 2 52 78 65 0.578 1.91 4.02 Intr + 99193 99320 128 1 2 95 -61 120 0.739 -2.80 4.03 Intr + 100002 100165 164 1 2 94 98 170 0.990 17.37 4.04 Intr + 100579 100680 102 0 0 65 105 71 0.823 5.95 4.05 Term + 118455 118625 171 2 0 48 41 133 0.041 1.44 4.06 PlyA + 120565 120570 6 1.05 5.06 PlyA - 121701 121696 6 1.05 5.05 Term - 122348 121918 431 0 2 -6 39 300 0.587 10.18 5.04 Intr - 135321 134515 807 1 0 67 49 361 0.225 20.59 5.03 Intr - 137873 137842 32 1 2 35 111 47 0.466 -1.44 5.02 Intr - 144195 144006 190 1 1 83 76 94 0.543 5.42 5.01 Init - 145842 145719 124 0 1 83 36 85 0.206 3.18 5.00 Prom - 149166 149127 40 -5.75 6.00 Prom + 150761 150800 40 -1.75 6.01 Init + 172488 172584 97 2 1 82 25 96 0.234 3.22 6.02 Intr + 176309 176370 62 0 2 28 110 80 0.284 1.73 6.03 Intr + 177155 177206 52 1 1 114 92 25 0.473 3.06 6.04 Intr + 181132 181259 128 2 2 107 115 133 0.899 17.38 6.05 Intr + 190003 190120 118 0 1 98 116 140 0.650 16.92 6.06 Intr + 200921 201071 151 0 1 90 88 71 0.004 5.60 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 16871 16838 34 1 1 95 105 8 0.933 0.51 S.002 Intr - 18891 18729 163 1 1 40 116 127 0.811 8.81 S.003 Init + 20214 20403 190 2 1 72 98 122 0.849 10.86 S.004 Sngl - 29576 28785 792 2 0 68 48 269 0.978 14.55 S.005 Init - 55888 55775 114 1 0 67 98 118 0.965 10.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:40210892_40412011|GENSCAN_predicted_peptide_1|807_aa MALSPTGFAAYGPKALKTLTVPVACQAEKREALAVHLELQLLSRVLEQVLEQPLRWRQEN LSSAPGTCTHPFILKAFLPTLYYIKTYITHSAGITGMSHHAQPVQSLLPAGSWFPSEMVQ VFKQTSGVPGEAVALSSDQLPGPPHSDLREDANAFRGQDTKPPEPTETWLLGQSSSYCGT DRLRLRRSPLAHVSSPQNGLTEVLKVINATEELIAGSTGPWESPQVPPDRQKGMFPRGTD QVRLDEQLTSLEENVRGYGGALSRVNVCEEEADEIVAYREECKDTHIHTHTTTTIANTFS GRTLGWIPQKQTLDEDLCACDLLGNAPKGGCTKEDVALELVETTMVTQGPPTSPKSADMN SNVALSLQVYLAAGTVYGLETQLTELEDAARCIHSGTDETHLADLEDQVATAAAQVHHAE LQISDIESRISALTIAGLNIAPCVRFTRRRDQKQRTQVQTIDTSRQQRRKLPAPPVKAEK IETSSVTTIKTFNHNFILQGSSTNRTKERKGTTKDLMNNTDWVGDLQRKNCIPIVLETGK SKVKGPASSKGLLAVSSHGRRQKGLLEYLPVILKILPNSSRMPSSNVTSMMVPDSFLAQD NPGITQIIEMASVLYHINPDRKLKPFTISLNTTTTKKTISFPLVADLNEGQLVNEGHQRM VELEEAEERRGRLTPHTARYPSETKLPDERSGSNISCSTIFAVLQPLLLIPRQTGSGVDL QQTPTDLQLRECSSSPAMEQSWMENDFNELREEGFRQSNFSELEEEVRTHHKEAKNLEKR LKEWLTRRTSIEKSLNDLMELKTMAQE >gi568815595f:40210892_40412011|GENSCAN_predicted_CDS_1|2424_bp atggctttatctcccactggctttgctgcctatggtcccaaggctctaaaaactctcact gtaccagtggcatgccaggctgaaaaaagagaagctttagctgtccacttagagctacag ctcttatccagagtgcttgaacaggtgttggaacagcccctgaggtggagacaggaaaat ctctcttctgctccaggcacatgtacacatccttttatcctcaaagcctttctgcccacc ttatattatatcaagacttatatcacacacagtgctgggattacaggcatgagccaccac gcccagcccgttcagtcacttctgccagctggaagctggttcccttctgaaatggtccag gtctttaagcagacctcaggtgttccaggtgaagctgtggcgctctcctcagaccagctg cctgggcctcctcactctgacttaagggaagatgcaaatgccttcaggggccaagacaca aagccccctgaaccaacagaaacatggctattgggtcagtcaagcagctattgtggaaca gacaggctaaggctaaggaggagccctctggcacatgtgtcctcaccccagaacggcttg acagaagtcctgaaagtcatcaatgccacagaggagttgatagcaggatctacagggccc tgggagtccccacaagtccctcctgacagacagaaggggatgtttcctcgtgggacagac caagtgagactggatgagcagctgacttccctggaagaaaatgtaagagggtatggaggt gccctgtctagggtgaatgtatgtgaagaagaagctgatgaaattgtagcttatcgtgaa gagtgcaaggacacacacatacacacacacaccactaccactatagcaaatacattctca ggaaggaccttaggttggattccccagaagcagaccctagatgaggatttgtgtgcatgt gacttattagggaatgctcccaagggaggctgcactaaagaggatgtagcactagaactt gtggagacaaccatggttactcagggacccccaaccagtcccaagtctgcagacatgaac tcaaatgtagcactgtctctacaggtatacctggcagcaggcactgtgtatggactggag acccagctgactgagctagaagatgccgcccgctgcatccacagtggcactgatgagacc catctggcggatctggaggaccaggtggccacggctgcagcccaagtccaccatgctgaa ctccagatttcagatattgagagccggatttcagccctgaccattgcaggattaaacata gcaccatgtgtgcgcttcacaagaagacgggatcagaagcaaaggacccaggtacaaacc atagatacatcaaggcagcaaaggaggaaactgcctgctccaccggtgaaagctgaaaaa attgagacatcttcagtgactaccattaaaacatttaaccacaacttcattctccaaggc tcctcaacaaacaggactaaggaaaggaaaggcaccaccaaggatttgatgaataacaca gactgggtgggtgatttacaaagaaaaaactgtattcctatagttctggaaactgggaag tctaaggtcaaagggcctgcatctagcaagggccttcttgctgtgtcatcccatggcaga aggcagaagggtctcttggaataccttcctgtgatactgaaaatcctaccgaattcttca agaatgcccagctcaaatgttacctctatgatggtccctgactccttcctagcccaggat aatcctggcataacccagataatcgagatggcctctgtcttgtaccatatcaatccagac agaaaactcaaacccttcaccatttcacttaatacaacaacaacaaagaaaacaatcagc tttcccctagtagcagacctcaatgaaggacagctggtgaatgaagggcaccaacgaatg gtggagttggaagaagcagaggagaggaggggcagactgacacctcacacggccaggtac ccctctgagacgaagcttccagatgaacgatcaggcagcaatatttcctgttcaacaata ttcgctgttctgcagcctctgctgctgatacccaggcaaacagggtctggagtggacctc cagcaaactccaacagacctgcagctgagggaatgcagctcctcgccagcaatggaacaa agctggatggagaatgactttaacgagttgagagaagaaggcttcagacaatcaaacttc tccgagctagaggaggaagttcgaacccatcacaaagaagctaaaaaccttgaaaaaaga ttaaaagaatggctaactagaagaaccagcatagagaagtccttaaatgacctgatggag ctgaaaaccatggcacaagaatga >gi568815595f:40210892_40412011|GENSCAN_predicted_peptide_2|364_aa MSELPFTITSKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIV KMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARTAKSILSQKNKAGGIMLP DFKLHYKATVTKPAWYWYQNRDIDQWNRTEPSEIIPHIYHHLIFDKPDKNKQWGKDSLFN QWCWENWLAICRKLKLNPFLTLYTKINSRWIKDLHVRAKTIKTIEENLGNTIQDTGMGKD FMSKTPKAMATKAKIDKWDLIKLKSFCIAKETTIRVNRQPTEWEKIFAIYSSDKGLISRI YKELKQIYKKKNKQPHQKVGEGYEQTLLKRRHFCSQQTHEKMLIITGHQTNANRNHNEIP SHTS >gi568815595f:40210892_40412011|GENSCAN_predicted_CDS_2|1095_bp atgagtgaactcccattcacaattacttcaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaattataaaccactgctcaacgaaataaaa gaggacacaaacaaatggaagaacattccatgctcatggataggaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcactgccaagtcaatcctaagtcaaaagaacaaagctggaggcatcatgctacct gacttcaaactacactacaaggctacagtaaccaaaccagcatggtactggtaccaaaac agagatatagaccaatggaacagaacagagccctcagaaataataccacacatctaccac catctgatctttgacaaacctgacaaaaacaagcaatggggaaaggattccctatttaat caatggtgctgggaaaactggctagccatatgtagaaagctgaaactgaatcccttcctt acactttatacaaaaattaattcaagatggattaaagacttacatgttagagctaaaacc ataaaaaccatagaagaaaacctaggcaataccattcaggacacaggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcatagcaaaagaaactaccattagagtgaacaggcaacct acagaatgggagaaaatttttgcaatctattcatctgacaaagggctaatatccagaatc tacaaagaactcaaacaaatttacaagaaaaaaaacaaacaaccgcatcaaaaagtgggc gaaggatatgaacagacacttctcaaaagaagacatttttgcagccaacagacacatgaa aaaatgctcatcatcactggccatcagacaaatgcaaatcgaaaccacaatgagatacca tctcacaccagttag >gi568815595f:40210892_40412011|GENSCAN_predicted_peptide_3|66_aa MDSKGHSDEVSDGTEEQVGMGSGPVAQAQYSLPGRVGGASSAVKSETQAEAPLATEVTER IVCHYC >gi568815595f:40210892_40412011|GENSCAN_predicted_CDS_3|201_bp atggacagtaaaggccattctgatgaggtttcagatggaactgaagaacaagtgggcatg ggatctgggccagtagcgcaagctcagtacagcctgccaggccgagtgggtggagccagc tcagcagtcaaaagtgaaactcaggcagaggcaccactggccacagaggttaccgaaaga atcgtgtgtcactactgttag >gi568815595f:40210892_40412011|GENSCAN_predicted_peptide_4|218_aa MLNSEHSSWGYSSVMPTNAGNLNGALSHPPRGALSRRPENGDGAPGDPFPEDSAGLRALL SEPRPRALRHPLPHPFADATKGDDLLPAGTEDYIHIRIQQRNGRKTLTTVQGIADDYDKK KLVKAFKKKFACNGTVIEHPEYGEVIQLQGDQRKNICQFLLEDSESEALDSDSGSTTFQL YGLIKPKPASLSQPQFSQQQLKDNCASDQNDYSRHKMR >gi568815595f:40210892_40412011|GENSCAN_predicted_CDS_4|657_bp atgctgaattcagagcatagcagctggggctattcatcagttatgcccactaacgcagga aatctgaatggtgcattatcacatccaccacgtggggctttgtctcggcgcccagaaaat ggcgacggggcccctggcgaccccttccccgaggacagcgccggactccgagccctcctc tcggagccccggccgcgggccctgcgccaccccctgccccacccctttgctgatgcaact aagggtgacgacttactcccggcagggactgaggattacattcatataagaatccagcaa cggaacggcagaaagacactgactactgttcagggcattgcagatgattatgacaaaaag aaacttgtgaaagctttcaaaaagaaatttgcctgtaatggtactgtgattgaacatcct gaatacggagaggttattcagcttcaaggtgaccaaagaaaaaacatctgccagtttctc ttggaggactcagaatcagaagccctggattcagattcgggctctaccactttccagctt tatggcctgatcaagccaaagccagccagtctctcacagcctcagttttctcagcagcaa cttaaggataattgtgcctcggatcagaatgactattctcggcacaaaatgagatga >gi568815595f:40210892_40412011|GENSCAN_predicted_peptide_5|527_aa MLGELQCPEDFQHLHPSPADSVFFGGTNGRCSVQVVLLKNTVFSFFGLSSHGQAVVRTSC VSGCLLASGSPTLHPPGAPLPLQPYSSEMRASWRSELWSITWESRPYLESNSSKAVLEVL ARTIRQEKQIKGIQLGKEEVKLSLFTDDMIVYLENPIVSAQNLLKRISNFSKVSGYKINV QKSQAFLYTNNRQTESQIMSELPFTIASNRIKYLGIRLTKDVKDLFKENYKPLLSEIEED TNKWKNIPCSWTGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQRRAH IAKSILSQKNKAGSITLPDFKLYYKATVTKTAWYWYQNRDTDQWKRTEPSEIKLTFNQIS YYLTSGRPNCQYFRLLNFTQSNLTALQAAIWERPGGKAFLGTDLTFMGVNERFVPWSPYS GVRVKWRRVNFSASVPFRGCTQSCVLDFPDHFIQSRTRLQGPQEMECADLHSNISSANNN YTDEVQQALTVQSASSGSHASSHSLIHSSPQHRSSAGTCIFLFTLQA >gi568815595f:40210892_40412011|GENSCAN_predicted_CDS_5|1584_bp atgctgggggaacttcagtgtcctgaagacttccagcatcttcacccttctccagctgac tctgtgttttttggaggaacaaatggaagatgctcagtccaggttgtccttctaaagaac acagtcttctccttctttggactatcatctcatggacaggctgtggtccggacttcgtgt gtaagtggttgcttattggctagtggcagccccacactacatcctccaggtgctcctctt ccactgcagccttacagctcagaaatgagagcatcctggaggagtgaactgtggagcatc acctgggaatccaggccttacctggaatctaatagctccaaagcagtgttggaagttctg gccaggacaatcaggcaggagaaacaaataaagggtattcaattaggaaaagaggaagtc aaattgtccctgtttacagatgacatgattgtatatctagaaaaccccatcgtctcagcc caaaatctccttaagcggataagcaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcttatacaccaacaacagacaaacagagagccaaatcatgagt gaactcccattcacaattgcttcaaacagaataaaatacctaggaatccgacttacgaag gatgtgaaggacctcttcaaggagaactacaaaccactgctcagtgaaatagaagaggat acaaacaaatggaagaacattccatgctcatggacaggaagaatcaatattgtgaaaatg gccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatgact ttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaagaagagcccac attgccaagtcaatcctaagccaaaagaacaaagctggaagcatcacactacctgacttc aaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagat acagaccaatggaaaagaacagagccctcagaaattaagctgacttttaatcaaatttct tattacctgacttcaggcaggccaaactgccaatatttccggcttttgaactttacccaa agtaaccttacagcccttcaagcagccatctgggaaagaccaggtggaaaggcgttcttg gggacagacctaactttcatgggtgtcaatgagcgctttgtcccgtggtctccctattct ggtgtacgtgtgaagtggaggagggtcaatttctcagcttctgtccccttcagggggtgt actcagagctgcgtgttagacttccctgaccattttatccagagcagaaccaggctccaa ggaccccaggaaatggagtgcgcggaccttcacagtaatatcagcagtgctaacaataac tataccgacgaagttcagcaagcactcactgtgcagagtgcaagctcaggcagtcatgca tcatcacactcactcattcactcctccccgcaacaccgcagctctgccggcacttgcatc ttccttttcactctccaggcctag >gi568815595f:40210892_40412011|GENSCAN_predicted_peptide_6|203_aa MGPVGLGKWIREEENGHREMCETVAKIQKSVHALAAGRLCGSASRLLRIGSAQLGEKMFT VLTRQPCEQAGLKALYRTPTIIALVVLLVSIVVLVSITVIQIHKQEVLPPGLKYGIVLDA GSSRTTVYVYQWPAEKENNTGVVSQTFKCSVKGSGISSYGNNPQDVPRAFEECMQKVKGQ VPSHLHGSTPIHLGATAGMRLLS >gi568815595f:40210892_40412011|GENSCAN_predicted_CDS_6|609_bp atggggcctgttggcttgggaaaatggatcagagaggaagaaaatggacacagagagatg tgtgagaccgttgccaagatccagaaaagtgtgcatgcgctggccgcgggccgcctctgc ggcagcgctagtcgccttctccgaatcggctccgcacagctaggagaaaagatgttcact gtgctgacccgccaaccatgtgagcaagcaggcctcaaggccctctaccgaactccaacc atcattgccttggtggtcttgcttgtgagtattgtggtacttgtgagtatcactgtcatc cagatccacaagcaagaggtcctccctccaggactgaagtatggtattgtgctggatgcc gggtcttcaagaaccacagtctacgtgtatcaatggccagcagaaaaagagaataatacc ggagtggtcagtcaaaccttcaaatgtagtgtgaaaggctctggaatctccagctatgga aataacccccaagatgtccccagagcctttgaggagtgtatgcaaaaagtcaaggggcag gttccatcccacctccacggatccacccccattcacctgggagccacggctgggatgcgc ttgctgagn