GENSCAN 1.0 Date run: 7-Nov-116 Time: 18:01:10 Sequence gi568815596f:68545822_68755573 : 209752 bp : 39.81% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 9705 9744 40 -3.85 1.01 Init + 10119 10293 175 2 1 71 80 116 0.715 8.66 1.02 Intr + 31999 32182 184 2 1 90 10 343 0.041 24.62 1.03 Intr + 43399 43474 76 1 1 60 27 81 0.011 -2.20 1.04 Intr + 45466 45744 279 0 0 23 53 175 0.035 4.25 1.05 Intr + 49568 49604 37 1 1 56 80 39 0.057 -3.18 1.06 Intr + 51124 51177 54 2 0 116 74 62 0.119 5.63 1.07 Intr + 53276 53722 447 0 0 67 86 195 0.172 9.59 1.08 Term + 53778 54268 491 1 2 75 45 149 0.990 3.13 1.09 PlyA + 55806 55811 6 1.05 2.03 PlyA - 58776 58771 6 1.05 2.02 Term - 60686 60623 64 1 1 140 38 30 0.713 -0.42 2.01 Init - 61412 61267 146 2 2 73 68 88 0.780 4.94 2.00 Prom - 68313 68274 40 -3.65 3.03 PlyA - 69077 69072 6 1.05 3.02 Term - 70121 69219 903 2 0 -16 41 337 0.521 9.93 3.01 Init - 70590 70411 180 0 0 70 41 181 0.521 10.83 3.00 Prom - 71540 71501 40 -6.15 4.02 PlyA - 71709 71704 6 1.05 4.01 Sngl - 72953 72624 330 2 0 88 44 285 0.988 19.87 4.00 Prom - 76274 76235 40 -7.05 5.00 Prom + 77332 77371 40 -6.85 5.01 Init + 81289 81513 225 0 0 49 20 189 0.224 6.72 5.02 Intr + 89512 89704 193 0 1 82 88 38 0.045 1.34 5.03 Intr + 89947 90102 156 2 0 42 76 93 0.046 2.56 5.04 Intr + 97262 97353 92 0 2 104 47 83 0.027 4.59 5.05 Term + 97427 97987 561 1 0 -4 48 234 0.026 3.58 5.06 PlyA + 98325 98330 6 1.05 6.00 Prom + 99509 99548 40 -9.35 6.01 Init + 100013 100485 473 1 2 110 109 670 0.788 66.34 6.02 Intr + 109059 109675 617 0 2 107 115 717 0.459 67.36 6.03 Intr + 110542 110645 104 2 2 68 74 104 0.510 5.97 6.04 Intr + 110815 110861 47 0 2 86 91 63 0.450 2.69 6.05 Term + 121926 122049 124 0 1 63 41 69 0.025 -3.42 6.06 PlyA + 122243 122248 6 1.05 7.00 Prom + 126012 126051 40 -4.25 7.01 Init + 134066 134274 209 1 2 68 116 154 0.217 12.54 7.02 Intr + 146851 146932 82 1 1 90 77 45 0.035 2.32 7.03 Intr + 162956 163136 181 1 1 75 55 135 0.003 7.42 7.04 Intr + 163875 163927 53 1 2 52 89 6 0.013 -5.19 7.05 Intr + 164731 164877 147 0 0 83 109 32 0.009 4.31 7.06 Term + 175128 175262 135 2 0 52 38 129 0.001 1.34 7.07 PlyA + 175300 175305 6 1.05 8.04 PlyA - 175452 175447 6 1.05 8.03 Term - 177793 177558 236 2 2 35 49 205 0.274 6.80 8.02 Intr - 187011 186896 116 2 2 83 42 107 0.051 4.77 8.01 Init - 194706 194591 116 0 2 69 110 52 0.700 5.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 31999 32201 203 2 2 90 39 339 0.953 25.77 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:68545822_68755573|GENSCAN_predicted_peptide_1|580_aa MEPTQMRINKRVDKQCKESMMEYYSTVRRNELMALTVTWMRLETIILSEVSQEWKTKHVR NVLDEDNDNVGQPNEYDLNDSFLDDEEEDYEPTDEDSDWEPGKEDEEKEDVEELLKEAKR RNRAEDKVGGNFAEEKPQAELSEHGHTKETENAVNFSLYVTGNEYMRPPDSASEPKSDLI LNGGDVEKRVQPGPTEGQITDNFHQSSPKEGADSTLAKNVNLNLIKPLNLTPRIHVKMTG WCSSKELPDLEHNIVNGENTLSIDIEVPMLEVLARAIRQEKEIKGIQLGKEEVKFSLFAD DMILYLENPIVSTQNLLKLISNFSKVSGYKINVQESQAFLYTNNRKTESQIMSELPFTIA SKRIKYLGIQLTRDVKDLFKENYKPLHNKIKGHKQMEEHSMLMDRKNQYRENGHTAQELE KTTLKFIWNQQRACIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYSYQNIDIDQWNR TEPSEIIPHIYNYLIFDKSDKNKKWEKDSLFNKWCWEKLDPFLTPYTKINSRWIKDLNDR PKTIKTLEENLGNTFQDIGMGKNFMSKTPKAMAMKAKIDK >gi568815596f:68545822_68755573|GENSCAN_predicted_CDS_1|1743_bp atggaaccaacccaaatgcgcatcaataaacgagtggataaacagtgcaaagaaagtatg atggaatactactcaactgtaagaaggaatgaattaatggcactcacagtgacctggatg agattggagactattattctaagtgaagtatctcaggaatggaaaaccaaacatgtgaga aatgttttagatgaagataatgataatgttgggcaacccaatgagtatgacctgaacgac agctttctagatgatgaggaagaagactatgagccaacagatgaagattctgactgggaa ccaggaaaggaagatgaagagaaggaagatgtggaagagcttttgaaagaagcaaaaagg agaaacagggctgaagacaaggttggaggtaactttgcagaggagaagccacaggcagaa ctctcagaacatgggcataccaaggagacagagaatgctgtgaacttcagtttatatgta acaggcaatgaatacatgaggccaccagactcagcttctgagcccaagtcagacctgata ctcaatggaggtgacgtggaaaagagagtgcagccaggtcccaccgaggggcagatcaca gataacttccatcagagttcgcccaaagaaggagctgactccacccttgccaaaaatgtt aacctgaatctaattaagcctttaaacctaactcccaggatacatgtaaaaatgacaggg tggtgtagcagcaaagagctcccagacctggaacataatatcgttaatggagaaaatact ctcagtatagatattgaagtacccatgttggaagttctggccagggcaatcaggcaggag aaagaaataaagggtattcaattaggaaaagaggaagtcaaattttccctgtttgcagat gacatgattctatatctagaaaaccccattgtctcaacccaaaatctccttaagctgata agcaacttcagcaaagtctcaggatacaagatcaatgtgcaagaatcacaagcattttta tatacaaataacagaaaaacagagagccaaataatgagtgaactcccattcacaattgct tcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaag gagaactacaaaccactgcacaacaaaataaaaggacacaaacaaatggaagaacattcc atgctcatggataggaagaatcaatatcgtgaaaatggccatactgcccaagaattggaa aaaacaactttaaagttcatatggaaccaacaaagagcctgcattgccaagtcaatccta agccaaaagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggct acagtaaccaaaacagcatggtactcttaccaaaacatagatatagaccaatggaacaga acagagccctcagaaataataccacacatctacaactatctgatctttgacaaatctgac aaaaacaagaaatgggaaaaggattccctgtttaacaaatggtgctgggaaaaactggat cccttccttacaccttatacaaaaataaattcaagatggattaaagacttaaatgataga cctaaaaccataaaaaccctagaagaaaacctaggcaatacctttcaggacataggcatg ggcaagaacttcatgtctaaaacaccaaaagcaatggcaatgaaagccaaaattgacaaa tag >gi568815596f:68545822_68755573|GENSCAN_predicted_peptide_2|69_aa MDPNHGPGSPSTSHAEKLLEHQGFTCRCLTDQCFLTELLSTLNTTDPNRTWCLVLSTLHK CRSKFYAVI >gi568815596f:68545822_68755573|GENSCAN_predicted_CDS_2|210_bp atggatcccaaccatggaccaggttcccccagtacaagccatgctgagaaactgctggag caccagggttttacctgtagatgcttaacggaccaatgctttctgactgaactcctctct accctgaatacaacagaccctaataggacctggtgtttggtcttatcaacgctacacaaa tgtaggtccaaattctatgctgtcatttag >gi568815596f:68545822_68755573|GENSCAN_predicted_peptide_3|360_aa MDKFLDTYTLPRLNQEEVASLNRPITGAEIVAIINSLPTKKSPGPDGFTAKFYQRYKEEL HINRITDKNHMIISIDAEKAFDKIQQRFMLKTLNKLGIDGTYLKIVRAIYDKPTANIILN GQKLEAFPLKTGTRWRCPLSPLLFNIVLEVLARPIRQEKEIKGIQLGKEEVKLSLFADDM TVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQALLYTNNKQTESQIMSELPFTIASK RIKCLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINLMKMAILPKEIYR FNATPIKLPMTFFTELEKTTSKFIWNQKRACITKSILSQKNKAGGITLPDFKLYYKATVT >gi568815596f:68545822_68755573|GENSCAN_predicted_CDS_3|1083_bp atggataaattcctcgacacatacactctcccaagactaaaccaggaagaagttgcatct ctgaatagaccaataacaggagctgaaattgtggcaataatcaatagcttaccaaccaaa aagagtccaggaccagatggattcacagccaaattctaccagaggtacaaggaggaactg catataaacagaatcacagacaaaaaccacatgattatctcaatagatgcagaaaaggcc tttgacaaaattcaacaacgcttcatgctaaaaactctcaataaattaggtattgatggg acatatctcaaaatagtaagagctatctatgacaaacccacagccaatatcatactgaat gggcaaaaactggaagcattccctttgaaaactggcacaagatggagatgccctctctca ccactcctattcaacatagtgttggaagttctggccaggccaattaggcaggagaaggaa ataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagacgacatg actgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgataagcaac ttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattattatacacc aacaacaaacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaag agaataaaatgcctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaac tacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaacattccatgc tcatgggtaggaagaatcaatctcatgaaaatggccatactgcccaaggaaatttacaga ttcaatgccacccccatcaagctaccaatgactttcttcacagaattggaaaaaactact tcaaagttcatatggaaccaaaaaagagcctgcatcaccaagtcaatcctaagccaaaag aataaagctggaggcatcacactacctgacttcaaactatactacaaggctacagtaacc taa >gi568815596f:68545822_68755573|GENSCAN_predicted_peptide_4|109_aa MGKKQSRKTGNSKKQSTSPPPKEHSSSPATEQSWRENDFDELREEGFRRSNYSELREEIQ TKGKEVENFEKSLEECLTRITNTEKCLKELMELKTKAGELHEECRSLRS >gi568815596f:68545822_68755573|GENSCAN_predicted_CDS_4|330_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaagcagagcacctctcctcct ccaaaggaacacagttcctcaccagcaacggaacaaagctggagggagaatgactttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggaaattcaa accaaaggcaaagaagttgaaaactttgaaaaaagtttagaagaatgtttaactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctggagaacta catgaagaatgcagaagcctcaggagctga >gi568815596f:68545822_68755573|GENSCAN_predicted_peptide_5|408_aa MTHSALEYFNVIGQGLKHLFWQQPKKSSMSPCDVQKIQADPEPEIDLESQNTCAETESSP TSHPTALNQFLQQIQSPTAHDPATTGLSQEDQNLRQTPCVGSVSDLHLSPQLSPYKILQN GDSTELSDFTPRHIRILTPGRRGVCLYLSESSFIPPPWGVGKWRGPAESGPPDQLSGLWE TSQAYWLARCRESTSLSLFYGEVGQLATKMALVSVQAGPGDQLAAERGRVRPLRRRWAAK VSRRGAPCPEAGTAALHGCPRRGIDAEPCPEPQRPRSCPRRPFRSLSRATRSPPGPAPSR ASSSRAAAWELQTLPPRAIRTLPPRAIPASAAQLATRLAKSPHLGASRPPEAGTGSPARA GSGDLQRRREGNSPARAGKEPRAEEEGSGDLPFACRLLEEASPDPAPE >gi568815596f:68545822_68755573|GENSCAN_predicted_CDS_5|1227_bp atgacccatagtgctctggagtactttaatgtgattggccaaggcttgaagcatctcttc tggcagcagcccaagaagtcatccatgtctccatgtgatgtgcagaaaattcaggcagat ccagaacctgaaatagatctggaaagccagaacacatgtgctgagactgagagtagcccc acctcccaccccacagctctgaatcagttcctgcaacaaatccaaagtcccactgcccat gaccctgccaccactggacttagtcaggaagaccagaatctgaggcagacaccatgtgtg ggctctgtctctgaccttcacctgtctcctcaactgtctccatataaaatactccagaat ggcgattccactgagctatcagacttcacgcccaggcatatccggatcttgacaccagga aggaggggcgtgtgtctgtacctatctgagtcttccttcatacctcctccttggggtgtg gggaagtggagaggaccagcagagagcggcccaccggatcagctgagtgggctttgggag accagtcaagcttactggctcgcacgctgcagagaaagcacttctctgagcctcttttac ggagaggtagggcagctggcaacaaagatggccctggtgtccgttcaggctggcccaggg gaccagctcgcggctgagcggggccgtgtccgtccactccgacggcgctgggctgccaag gtgagccgccgaggagccccgtgccccgaggcgggcacagccgccctgcatgggtgcccc cggcgcgggattgacgcagagccctgcccggagccccagcgaccccgttcctgccccagg cggccgttccgaagcctgagccgggcaacgcgcagcccgcctggcccggctccctcccgc gcgtcctcgtcgcgagctgccgcctgggagctccagacgctcccgcccagggcgatccgg acgctcccgcccagggcgatcccggcctctgcagcccagctggcgacacgcctcgccaag agcccgcacctcggcgcctcacggccaccggaggcggggacagggagtccggcgcgggcc gggtcgggggatctgcagcggcggcgcgaaggtaactcccccgccagagccgggaaagag ccccgagcagaggaggaagggagcggggacctgcccttcgcctgccggcttctggaagaa gcgtctcctgatccggccccggaataa >gi568815596f:68545822_68755573|GENSCAN_predicted_peptide_6|454_aa MGFMDDNATNTSTSFLSVLNPHGAHATSFPFNFSYSDYDMPLDEDEDVTNSRTFFAAKIV IGMALVGIMLVCGIGNFIFIAALVRYKKLRNLTNLLIANLAISDFLVAIVCCPFEMDYYV VRQLSWEHGHVLCTSVNYLRTVSLYVSTNALLAIAIDRYLAIVHPLRPRMKCQTATGLIA LVWTVSILIAIPSAYFTTETVLVIVKSQEKIFCGQIWPVDQQLYYKSYFLFIFGIEFVGP VVTMTLCYARISRELWFKAVPGFQTEQIRKRLRCRRKTVLVLMCILTAYVLCWAPFYGFT IVRDFFPTVFVKEKHYLTAFYIVECIAMSNSMINTLCFVTVKNDTVKYFKKIMLLHWKAS YNGVRREMQVEETQNGKAKLEACRAQCSSGVAGAQALQLSNVSDEEFPPNQTAWHGGPES RKDNGPMVTCGVHANTGLIISPSPSEIPISFHLV >gi568815596f:68545822_68755573|GENSCAN_predicted_CDS_6|1365_bp atggggttcatggatgacaatgccaccaacacttccaccagcttcctttctgtgctcaac cctcatggagcccatgccacttccttcccattcaacttcagctacagcgactatgatatg cctttggatgaagatgaggatgtgaccaattccaggacgttctttgctgccaagattgtc attgggatggccctggtgggcatcatgctggtctgcggcattggaaacttcatctttatc gctgccctggtccgctacaagaaactgcgcaacctcaccaacctgctcatcgccaacctg gccatctctgacttcctggtggccattgtctgctgcccctttgagatggactactatgtg gtgcgccagctctcctgggagcacggccacgtcctgtgcacctctgtcaactacctgcgc actgtctctctctatgtctccaccaatgccctgctggccatcgccattgacaggtatctg gctattgtccatccgctgagaccacggatgaagtgccaaacagccactggcctgattgcc ttggtgtggacggtgtccatcctgatcgccatcccttccgcctacttcaccaccgagacg gtcctcgtcattgtcaagagccaggaaaagatcttctgcggccagatctggcctgtggac cagcagctctactacaagtcctacttcctctttatctttggcatagaattcgtgggcccc gtggtcaccatgaccctgtgctatgccaggatctcccgggagctctggttcaaggcggtc cctggattccagacagagcagatccgcaagaggctgcgctgccgcaggaagacggtcctg gtgctcatgtgcatcctcaccgcctacgtgctatgctgggcgcccttctacggcttcacc atcgtgcgcgacttcttccccaccgtgtttgtgaaggagaagcactacctcactgccttc tacatcgtcgagtgcatcgccatgagcaacagcatgatcaacactctgtgcttcgtgacc gtcaagaacgacaccgtcaagtacttcaaaaagatcatgttgctccactggaaggcttct tacaatggcgtgagaagggagatgcaagtggaggagactcagaacgggaaggcaaaactg gaagcatgcagggctcaatgcagctcaggtgtggcaggggctcaggctctccagctctca aatgtgagtgatgaagagttccctcctaaccagacagcatggcatggtggtccagagagc agaaaagataatgggcccatggtaacctgtggtgtccatgccaacactggactgattatc tccccttctcctagtgagataccaataagcttccacctggtttaa >gi568815596f:68545822_68755573|GENSCAN_predicted_peptide_7|268_aa MPRPALARAQCAHPLTCAHCLALPSEMNPVPQMEMQKSPIFCVAHAGSCRPELFLFGHLG SSSIFVYSERELGELTIVAAPSSCSNTSLFAADSASEHCALASLPSCTMNGCEQVGSCWE RLRPVFPVDVDRTFLPAHIWQEDGPRCVQSSDAILSLGYDLEHIKPRSKHPVIWQISTSL GALKVVPAKGHTQLLVYSDFQPKWLWTLLEPPAPSPSINMSLQQMWLNAETLKAPPDSKA LLWHNILIAFSLLMSLELHMKLLGGNFN >gi568815596f:68545822_68755573|GENSCAN_predicted_CDS_7|807_bp atgcctcgccctgctttggctcgcgcacagtgcgcgcacccactgacctgcgcccactgt ctggcactccctagtgagatgaacccggtacctcagatggaaatgcagaaatcacccatc ttctgcgtcgctcacgctggtagctgtagaccggagctgttcctattcggccatcttggc tcctcctcaatttttgtatatagtgaaagggagctgggtgaattgactattgtagctgct ccttcctcctgcagtaacacttccctttttgctgctgacagtgcttcagagcactgtgcc ctggcttccctgccatcatgcacaatgaatggctgtgaacaggttgggagttgttgggaa aggctaagacctgtcttcccagttgatgtggaccggactttcctaccagctcacatttgg caagaagatggccctcgatgtgtacaaagctcagatgcaattttgtcactgggttatgat ctggaacatattaaaccaagatccaaacaccccgtaatttggcaaataagcaccagccta ggagctctcaaagttgttccagctaaaggacacactcagcttttggtctacagtgacttc cagcctaagtggctctggacactgctggaaccccccgctccatctccctccataaatatg tctctgcagcagatgtggctgaacgcagaaacactcaaagcacccccagactccaaagcc ttactgtggcataatatcctcattgctttttccctccttatgtccctggagctacacatg aaacttcttgggggcaacttcaactaa >gi568815596f:68545822_68755573|GENSCAN_predicted_peptide_8|155_aa MERGLQADEKELEATEGRMWLKSPVCSRRREGGKREVGRNERSTREKNLASAFNASAHTL SAHIPLANTGHMAKLKVTSSVGLCRTEKTASEKSIWNEEAQEASGCRSHNIIQQTIEACR TQHSHETATKDWEAQQGIYVTQRWTAYQLSGMWFE >gi568815596f:68545822_68755573|GENSCAN_predicted_CDS_8|468_bp atggagagagggctccaggcagatgagaaagagctggaagctacagaagggaggatgtgg ctgaagtctccagtgtgcagcaggagaagggaaggtggcaagagagaggtgggcaggaat gaaaggagcacaagagagaagaaccttgcaagcgcatttaacgcctctgcgcacaccctg tctgctcacattccattggccaacacaggtcacatggccaagctcaaagtcactagcagt gtgggactgtgccgcacagaaaagacagcttctgagaaaagcatttggaatgaagaagcc caagaagcctctggatgcaggagccataatattatacaacagaccatagaggcctgcagg actcagcacagtcatgaaacagccaccaaggactgggaagcccaacagggaatctatgtg acacagagatggactgcttaccagctaagcggcatgtggtttgagtga