GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:17:41 Sequence gi568815596r:202780863_203011476 : 230614 bp : 39.47% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 5962 6001 40 -3.25 1.01 Init + 28161 28445 285 2 0 80 70 195 0.346 14.02 1.02 Term + 30394 30459 66 0 0 96 44 76 0.945 0.96 1.03 PlyA + 30865 30870 6 1.05 2.08 PlyA - 33029 33024 6 1.05 2.07 Term - 33922 33836 87 1 0 84 47 116 0.822 3.78 2.06 Intr - 35147 35049 99 2 0 99 83 73 0.993 7.29 2.05 Intr - 36681 36556 126 0 0 68 80 67 0.866 3.86 2.04 Intr - 39037 38839 199 0 1 86 16 204 0.886 11.43 2.03 Intr - 40619 40496 124 0 1 61 8 141 0.851 2.12 2.02 Intr - 44905 44833 73 1 1 29 60 72 0.601 -3.44 2.01 Init - 48147 47986 162 0 0 56 84 131 0.662 9.28 2.00 Prom - 48794 48755 40 -6.45 3.07 PlyA - 48930 48925 6 1.05 3.06 Term - 60495 59962 534 0 0 -89 50 739 0.117 45.66 3.05 Intr - 60947 60555 393 2 0 67 -12 359 0.170 17.72 3.04 Intr - 63807 63640 168 0 0 109 72 43 0.009 4.02 3.03 Intr - 69195 69029 167 1 2 82 5 138 0.174 3.66 3.02 Intr - 90911 90757 155 1 2 55 119 195 0.996 18.09 3.01 Init - 91014 90944 71 0 2 62 24 147 0.986 4.27 3.00 Prom - 94890 94851 40 -4.15 4.13 PlyA - 94983 94978 6 1.05 4.12 Term - 100075 99998 78 1 0 88 48 50 0.841 -2.22 4.11 Intr - 101921 101849 73 1 1 66 78 113 0.985 6.59 4.10 Intr - 102879 102747 133 1 1 96 94 118 0.999 11.98 4.09 Intr - 103441 103336 106 1 1 121 113 -9 0.999 3.77 4.08 Intr - 103673 103533 141 2 0 92 91 93 0.975 9.63 4.07 Intr - 115357 115203 155 2 2 109 106 20 0.860 4.77 4.06 Intr - 116553 116438 116 2 2 80 72 134 0.952 10.17 4.05 Intr - 118775 118669 107 2 2 68 27 85 0.931 -1.51 4.04 Intr - 120257 120163 95 0 2 70 89 91 0.880 6.16 4.03 Intr - 127097 127003 95 1 2 99 64 127 0.659 10.09 4.02 Intr - 130683 130574 110 0 2 13 100 76 0.232 -0.54 4.01 Init - 131756 131655 102 2 0 59 36 136 0.171 5.81 4.00 Prom - 137166 137127 40 -8.15 5.00 Prom + 137266 137305 40 -6.45 5.01 Sngl + 143838 144320 483 2 0 68 36 429 0.736 31.52 5.02 PlyA + 145506 145511 6 1.05 6.00 Prom + 146648 146687 40 -3.55 6.01 Init + 161041 161154 114 0 0 84 60 93 0.615 6.36 6.02 Intr + 161878 162105 228 0 0 33 101 128 0.667 5.74 6.03 Intr + 171697 171817 121 0 1 52 84 123 0.530 7.45 6.04 Intr + 173143 173272 130 2 1 65 100 127 0.989 10.43 6.05 Intr + 174812 174896 85 2 1 77 105 37 0.986 3.20 6.06 Intr + 180375 180564 190 2 1 43 80 138 0.965 6.74 6.07 Intr + 186116 186236 121 0 1 80 111 59 0.958 6.03 6.08 Intr + 189057 189200 144 0 0 48 52 148 0.969 5.68 6.09 Intr + 190643 190876 234 2 0 85 82 96 0.796 4.58 6.10 Intr + 193472 193634 163 2 1 86 72 28 0.134 0.06 6.11 Intr + 201210 201579 370 2 1 47 83 261 0.059 15.25 6.12 Intr + 205248 205276 29 1 2 107 103 -21 0.088 -1.78 6.13 Term + 225804 226004 201 2 0 60 43 146 0.969 3.71 6.14 PlyA + 227801 227806 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 201414 201579 166 2 1 97 83 232 0.876 23.44 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:202780863_203011476|GENSCAN_predicted_peptide_1|116_aa MTSPNKLNSVLVTNPEVTEICDLSEREFKIAVLRKLSEIQDNTEKEFRILSGKLNKEVKI TFKNQAEIPELKNSIDIVKNASESTGVLIKQKKESTSRHLGKIEKNLREISGVNFA >gi568815596r:202780863_203011476|GENSCAN_predicted_CDS_1|351_bp atgacctcaccaaacaaactaaatagtgtactagtgaccaatcctgaagtgacagagata tgtgacctttcagagagagaattcaaaatagctgttttgaggaagctcagcgaaattcaa gataacacagagaaggaattcagaatcctatcaggtaaacttaacaaagaggttaaaata acttttaaaaatcaagcagaaattccagagttgaaaaattcaattgatatagtgaagaac gcatcagagtcaacaggagtattgatcaagcagaagaaagaatcaaccagccgacacttg gggaaaatagaaaagaacctacgtgaaatatcgggggtgaatttcgcctga >gi568815596r:202780863_203011476|GENSCAN_predicted_peptide_2|289_aa MDSFGQPRPEDNQSVVRRMQKKYWKTKQVFIKATGKKEDEHLVASDAELDAKLEVFHSVQ ETCTELLKIIEKYQLRLNVISEEENELGLFLKFQAERDATQAGKMMDATGKALCSSAKQR LALCTPLSRLKQEVATFSQRAVSDTLMTINRMEQARTEYRGALLWMKDVSQELDPDTLKQ MEKFRKVQMQVRNSKASFDKLKMDVCQKVDLLGASRCNMLSHSLTTYQRTLLGFWKKTAR MMSQIHEACIGFHPYDFVALKQLQDTPSKISEDNKDEQIGGFLTEQLNK >gi568815596r:202780863_203011476|GENSCAN_predicted_CDS_2|870_bp atggattcctttgggcaacccagaccagaagataatcagtcagtagtcagaagaatgcaa aagaaatactggaaaactaaacaggtctttatcaaagcaacaggaaaaaaagaggatgag cacttggtggcgtctgatgctgaactggatgctaaacttgaggtttttcactctgttcaa gagacatgcactgaacttctgaagataatcgagaaataccagctaagactcaatgttata tcagaggaagaaaatgagctagggctctttttaaaatttcaagcagaacgggatgcaact caagctggcaaaatgatggatgccactggcaaggcactttgttcttcagccaagcaaaga ttggccctgtgtactcctctgtctcgtctgaagcaagaagtagcaacattcagtcaaagg gcagtatctgataccttgatgacaattaatcggatggagcaggcacgcacagaatacaga ggagctctactgtggatgaaagatgtatcccaagagctggacccagacaccttaaagcaa atggaaaagtttagaaaagtacagatgcaagtgagaaatagcaaagcttcttttgacaag ttaaagatggatgtttgtcagaaagtggatttacttggagctagtcgctgcaatatgcta tctcattcgctcactacctaccagagaacactgcttggattctggaagaaaacagctcga atgatgtcccaaattcatgaagcctgtattggctttcatccgtatgattttgtagctctc aagcaactacaagacacgccaagcaagattagtgaagacaataaagatgaacaaataggc ggttttcttactgaacagctcaataagtaa >gi568815596r:202780863_203011476|GENSCAN_predicted_peptide_3|495_aa MLRSRVGPVAGSGRRPRGANADASGSLTPPRTGAEGHWLPEGRESARGSRAVAGTGAGTD HRTAERDPRRRKSGAGPSSAGLLQFAGGLLYNLFAWVSPAEATEQQRLLPVPSSESFVPE GHQPDASWNSPRRSGLSGKFGKQKLGKFISIPSFFSHTGKPLVIQLPLTGLLNLILVLGY EDYLALTPTCLHSCINHVHHGDPEVLQGVHQRWSSGPQAFGNCSYARGPGACISSLSFSR VGSSIFRGVLGGSYGGASGMGSITAVTVNQSLLSPLNLEVEPNIQAVCTQEKDPQQQICL LHRQGTIPGAAEQDAGDQSYINSLKQQLETLGLERLKLEAELGNMPGLVEDFKNKYEDEI NKRTEMENEFVLIRQYVDEAYMNKVELESCLEGLTEEINLLGQLYEEEIQKLQSQILDTS VVLCMNNSHSLDMDSIIAEVKAQYEEMANHSQAEAESMYQIKYEELQTLAGKHGDDLRHA KTEIWDEPERQPAPG >gi568815596r:202780863_203011476|GENSCAN_predicted_CDS_3|1488_bp atgctccgatcccgagtcgggccggtggcggggtcaggccggcgcccgaggggggcgaac gcggacgccagcgggagcctgacacctccacgcactggtgcggaggggcactggctgccg gagggccgggagtcggcgcggggctcgcgggcggtggccggaacgggtgccgggacggat caccgtacggccgaacgcgacccgaggaggcggaagagcggcgccggcccctcttctgca ggtctgctacagtttgctggaggtctactctataacctgtttgcctgggtatcaccagca gaggctacagaacagcaacgattgctgcctgttccttcctctgaaagctttgttccagag gggcaccagcctgatgccagctggaactctcctaggaggagtggcttgtctggtaaattt gggaagcaaaaattgggcaagtttatctcaattccttctttcttctctcatacagggaag cctctggtgattcagttacctctgactggcttgttgaatttaattttagtgctcggttat gaagattatttggctctcacacccacctgcctccactcctgcatcaaccatgtccatcat ggtgacccagaagtcctacaaggtgtccaccagaggtggtcctctggcccccaggccttc ggcaactgctcctatgcgcgtgggcccggtgcctgcatcagctccttgagcttctcccga gtgggcagcagcatctttcggggtgtcctgggtggaagttatggtggggccagtggcatg ggaagcatcactgctgtcacagtcaaccagagcctgctgagcccccttaacctggaggtg gaacccaacatccaggccgtgtgcacccaggagaaggaccctcaacaacaaatttgcctc ctccatagacaaggtacaattcctggagcagcagaacaagatgctggagaccaaagctac atcaacagccttaagcagcagctggagactctgggcttggaaaggctgaagctggaggca gagcttggcaacatgccagggctggtggaagacttcaagaacaaatatgaggatgagatc aacaaacgtacagagatggagaatgaatttgtcctcatcaggcagtatgtggatgaagct tatatgaacaaggtagagctagagtcttgcctggaagggctgactgaagagatcaacctc ctcgggcagctgtatgaagaggagatccagaagctgcagtcccagatcttggacacatct gtggtgctgtgcatgaacaacagccactccctggacatggacagcatcatcgctgaggtc aaggcgcagtatgaggagatggccaaccacagccaggctgaggctgagagcatgtaccag atcaagtatgaggaactgcagacgctggctgggaagcacggggatgacctgcggcatgca aagactgagatctgggatgaaccagaacgtcagccagctccaggctga >gi568815596r:202780863_203011476|GENSCAN_predicted_peptide_4|436_aa MWDPEKAPKLGAEGAASDRLRLGAEAGSCSELRRAGASCCGSANPYVSVGKSCVLLAMAQ LQTRFYTDNKKYAVDDVPFSIPAASEIADLSNIINKLLKDKNEFHKHVEFDFLIKGQFLR MPLDKHMEMENISSEEVVEIEYVEKYTAPQPEQCMFHDDWISSIKGAEEWILTGSYDKTS RIWSLEGKSIMTIVGHTDVVKDVAWVKKDSLSCLLLSASMDQTILLWEWNVERNKVKALH CCRGHAGSVDSIAVDGSGTKTPIVTLSGHMEAVSSVLWSDAEEICSASWDHTIRVWDVES GSLKSTLTGNKVFNCISYSPLCKRLASGSTDRHIRLWDPRTKDGSLVSLSLTSHTGWVTS VKWSPTHEQQLISGSLDNIVKLWDTRSCKAPLYDLAAHEDKVLSVDWTDTGLLLSGGADN KLYSYRYSPTTSHVGA >gi568815596r:202780863_203011476|GENSCAN_predicted_CDS_4|1311_bp atgtgggaccccgaaaaggcccccaagcttggcgcggaaggcgccgccagtgaccggctg cggctgggggcggaggccggcagttgctcggagctccggcgggcaggagcttcgtgttgt gggtctgctaacccgtacgtttccgtgggcaagtcgtgtgtactcctcgccatggctcag ctccaaacacgcttctacactgataacaagaaatatgccgtagatgatgttcccttctca atccctgctgcctctgaaattgccgaccttagtaacatcatcaataaactactaaaggac aaaaatgagttccacaaacatgtggagtttgatttccttattaagggccagtttctgcga atgcccttggacaaacacatggaaatggagaacatctcatcagaagaagttgtggaaata gaatacgtggagaagtatactgcaccccagccagagcaatgcatgttccatgatgactgg atcagttcaattaaaggggcagaggaatggatcttgactggttcttatgataagacttct cggatctggtccttggaaggaaagtcaataatgacaattgtgggacatacggatgttgta aaagatgtggcctgggtgaaaaaagatagtttgtcctgcttattattgagtgcttctatg gatcagactattctcttatgggagtggaatgtagagagaaacaaagtgaaagccctacac tgctgtagaggtcatgctggaagtgtagattctatagctgttgatggctcaggaactaaa actcccatagtgaccctctctggccacatggaggcagtttcctcagttctgtggtcagat gctgaagaaatctgcagtgcatcttgggaccatacaattagagtgtgggatgttgagtct ggcagtcttaagtcaactttgacaggaaataaagtgtttaattgtatttcctattctcca ctttgtaaacgtttagcatctggaagcacagataggcatatcagactgtgggatccccga actaaagatggttctttggtgtcgctgtccctaacgtcacatactggttgggtgacatca gtaaaatggtctcctacccatgaacagcagctgatttcaggatctttagataacattgtt aagctgtgggatacaagaagttgtaaggctcctctctatgatctggctgctcatgaagac aaagttctgagtgtagactggacagacacagggctacttctgagtggaggagcagacaat aaattgtattcctacagatattcacctaccacttcccatgttggggcatga >gi568815596r:202780863_203011476|GENSCAN_predicted_peptide_5|160_aa MGQMSALTPRTSPGWATAAASGVAWTLAWVWREAMPGLVVWRSITAATVTQSLPSPLKLD VHTNTQAVCTPEKLQIKTLNKFAPFTEKVPFLEQQNKILETKWILLQQQKVAWSNMDSMF ESYINNLKQQLDTLSQEQLKLEAELGNMERPVEDYQKCEV >gi568815596r:202780863_203011476|GENSCAN_predicted_CDS_5|483_bp atgggccagatgtctgcattaactcctagaacttctcctggatgggcaacagcagcagct tccggggtagcctggacactggcatgggtctggagggaggctatgccaggtctggtggta tggaggagcatcacagctgccacagtgacccagagcctcccgagtccccttaagctggat gtgcacaccaacacccaggctgtatgcaccccggagaaattgcagatcaagaccctaaac aagtttgcccccttcactgagaaggtacccttcctggagcagcagaacaagatactggag accaagtggatccttttgcagcagcaaaaagtagcttggagcaacatggacagcatgttt gagagctacatcaacaacctaaagcagcaactggacacactgagccaggagcagctgaag ctggaggcagaacttggcaacatggagagaccagtggaggactatcaaaagtgtgaagtt taa >gi568815596r:202780863_203011476|GENSCAN_predicted_peptide_6|709_aa MEQSNDSLRVNHNDGEESKTSAQVFEVWIQLGTFFHPRHLICMDSRDSSFGQNDSPTVLP ITTREANNSLISQNIPGPLTQTQTLSAEQFHLVDQNGQAIQYELQSLGESNAQMMIVASP TENGQVLRVIPPTQTGMAQVIIPQGQLVDVNSPRDVPEEKPSNRNLPTVRVDTLADNTSN YILHPQTSFPLPKKSVTGMLEEPLLGPLQPLSSNTPIWACRLRSCEKIGDSYRGYCVSET ELESVLTFHKQQTQSVWGTRQSPSPAKPATRLMWKSQYVPYDGIPFVNAGSRAVVMECQY GPRRKGFQLKKVSEQESRSCQLYKATCPARIYIKKVQKFPEYRVPTDPKIDKKIIRMEQE KAFNMLKKNLVDAGGVLRWYVQLPTQQAHQYHELETPCLTLSPSPFPVSSLEEEETAVRD ENCALPSRLHPQVAHKIQELVSQGIEQVYAVRKQLRKFVERELFKPDEVPERHNLSFFPT VNDIKNHIHEVQKSLRNGDTVYNSEIIPATGLQLQPRYTSPDESPAVVSVNNQPSSSPSG LLDTIGSAVMNNNSLLLGQSHSLQRDTCLTQNNSTASTMGNLPEPDQNLVAMDELVEVGD VEDTGNLEGTVHRILLGDVQTIPIQIIDNHSALSRNWSYQYFHISKYVRQKLIELQGEMD ESTIIVGDLSTPLLYKRTDPSSRHKDIVELNSNIDICRLLIQQQQNTHS >gi568815596r:202780863_203011476|GENSCAN_predicted_CDS_6|2130_bp atggaacaatctaatgattcattaagagtcaaccataatgacggtgaagagtcaaaaacc agtgctcaagtatttgaggtatggattcagctgggaacctttttccatcctaggcatcta atctgtatggactccagggattcttcctttggacaaaatgattctcctacagttttgccc atcactactcgtgaagcaaataattcactcatatcacagaatataccagggcccctgact cagacacagactctttctgcagagcaattccatctagtggaccaaaatgggcaggctatt caatatgaacttcagtcattgggggaatccaatgcacaaatgatgatcgttgccagccca acagaaaatggacaggtacttcgtgtaattccacctacccagacaggaatggcacaagtg attatacctcaggggcaacttgtggatgtgaatagtcctcgggatgtccctgaagagaaa cccagtaacagaaacttaccaactgtaagagtggatactctagcagacaataccagcaat tacattcttcatcctcaaacatccttcccattgcccaaaaagtcagtgaccggaatgctg gaagaaccccttctggggcctcttcagccactttcttctaatacacctatatgggcctgc cgtcttaggagctgtgagaaaattggagattcataccgtggctactgtgtaagtgagact gaattagaaagtgtcctaacatttcacaagcagcaaacacagagtgtttgggggacccgt cagtctccaagcccagccaagcctgctacacgcttgatgtggaaatcccagtatgttcca tatgatggaatcccatttgttaatgcagggagtagagctgtggtaatggagtgtcagtat gggccaagaagaaaaggtttccagttaaaaaaagtcagtgagcaggaaagcaggtcttgt cagctctacaaagccacttgtccagctcggatttacattaaaaaggtacagaagtttcct gaatatagagttcctacagaccccaaaattgacaagaaaattatcagaatggagcaggag aaagcttttaacatgctaaagaagaacttggtagatgctggtggtgttcttaggtggtat gtacagttacctacacagcaagctcatcagtatcatgaattagagactccctgcctcact ttgtcaccttctccttttcctgtgtcttctcttgaagaagaggaaactgcagttagagat gagaattgtgcattaccctcacgtttacatcctcaagtagcacataagattcaagaatta gtatcacagggaatagaacaagtgtatgcagtaaggaaacagctaagaaaatttgtggaa agggaactgttcaaacccgatgaggtacctgaaagacataatttatctttttttccaact gtaaatgatataaaaaatcacatccatgaggtacagaaatccttgagaaatggagatacg gtatataactcagagattattccagcaacgggtttgcagttacaaccaaggtacacctct cctgatgaatcaccagctgtggtatcagtaaataaccagccgtcctctagtccttcagga cttctggatacaataggaagtgctgtaatgaataataattctctactgcttggtcaaagt catagccttcaaagagatacatgcttaacccaaaacaatagtactgcctccaccatgggt aaccttccagaaccagatcaaaatctagttgcaatggacgagctggtagaagttggagat gttgaggatacagggaatctggaaggaactgttcatcggattctgttgggagatgtgcag actattccaatacagattatagacaaccactcagctcttagtagaaactggagctatcaa tattttcatatatcaaaatatgtgaggcaaaaactgatagaactccaaggagaaatggat gaatccactattatagttggagacttaagtacccctctattatataaacggacagatcca tcaagcagacataaggacatagttgaattgaacagtaacatcgacatctgtcgactactt atccaacaacagcagaatacacattcttaa