GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:57:36 Sequence gi568815594f:9682030_9883460 : 201431 bp : 46.05% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9529 9566 38 0 2 106 97 78 0.876 7.99 1.02 Intr + 14619 14763 145 0 1 47 64 143 0.301 8.08 1.03 Intr + 15992 16122 131 1 2 101 37 140 0.923 9.69 1.04 Intr + 16192 16454 263 1 2 92 95 288 0.923 27.03 1.05 Intr + 17316 17465 150 1 0 37 71 196 0.648 12.93 1.06 Term + 19328 19350 23 0 2 44 34 51 0.195 -6.23 1.07 PlyA + 19894 19899 6 1.05 2.10 PlyA - 20222 20217 6 -0.45 2.09 Term - 21849 21601 249 0 0 123 49 172 0.994 12.50 2.08 Intr - 22930 22855 76 0 1 97 68 117 0.947 10.12 2.07 Intr - 24062 23945 118 0 1 65 46 134 0.942 6.42 2.06 Intr - 25720 25610 111 2 0 38 66 101 0.747 3.25 2.05 Intr - 27950 27808 143 1 2 91 46 82 0.877 4.30 2.04 Intr - 28780 28670 111 0 0 82 82 115 0.992 9.79 2.03 Intr - 44361 44265 97 1 1 87 44 107 0.048 5.27 2.02 Intr - 45553 45422 132 2 0 74 87 31 0.511 2.22 2.01 Init - 48368 48344 25 2 1 59 80 52 0.231 0.04 2.00 Prom - 51562 51523 40 -2.86 3.00 Prom + 52360 52399 40 -6.86 3.01 Init + 53805 54040 236 2 2 71 88 142 0.544 10.11 3.02 Term + 72825 73209 385 0 1 92 47 333 0.734 23.96 3.03 PlyA + 75275 75280 6 1.05 4.00 Prom + 75774 75813 40 -7.76 4.01 Init + 79509 80331 823 2 1 65 38 304 0.664 17.98 4.02 Intr + 81489 81548 60 1 0 86 110 14 0.541 2.11 4.03 Intr + 87362 87404 43 0 1 59 64 25 0.001 -5.50 4.04 Intr + 99682 99868 187 1 1 67 89 168 0.919 14.49 4.05 Term + 99977 101434 1458 1 0 -2 37 2623 0.747 238.91 4.06 PlyA + 101620 101625 6 1.05 5.00 Prom + 102729 102768 40 -6.16 5.01 Init + 106316 106323 8 1 2 114 91 0 0.118 3.40 5.02 Intr + 113773 113945 173 1 2 67 55 36 0.016 -2.21 5.03 Intr + 135112 135217 106 2 1 94 95 63 0.912 6.87 5.04 Term + 136097 136277 181 2 1 76 33 124 0.942 2.78 5.05 PlyA + 138339 138344 6 1.05 6.00 Prom + 147360 147399 40 -1.16 6.01 Init + 148752 148821 70 2 1 38 83 62 0.559 1.91 6.02 Term + 152799 152992 194 1 2 13 52 197 0.876 6.18 6.03 PlyA + 154271 154276 6 1.05 7.02 PlyA - 155606 155601 6 1.05 7.01 Sngl - 156267 156163 105 0 0 56 48 243 0.373 8.29 7.00 Prom - 157599 157560 40 -3.36 8.00 Prom + 158503 158542 40 -3.36 8.01 Sngl + 168855 169202 348 2 0 92 37 132 0.436 4.55 8.02 PlyA + 169647 169652 6 1.05 9.00 Prom + 173967 174006 40 0.24 9.01 Init + 174789 174932 144 2 0 95 72 57 0.704 4.92 9.02 Intr + 177702 177824 123 2 0 100 69 56 0.662 5.68 9.03 Term + 188162 188281 120 1 0 90 42 56 0.103 -0.33 9.04 PlyA + 190378 190383 6 1.05 10.03 PlyA - 191899 191894 6 1.05 10.02 Term - 196337 196245 93 2 0 100 50 63 0.570 1.53 10.01 Intr - 198889 198726 164 2 2 42 46 93 0.185 0.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 44361 44261 101 1 2 87 49 109 0.828 5.19 S.002 Init + 96894 96901 8 2 2 99 97 2 0.831 2.56 S.003 Intr - 144571 144391 181 1 1 89 95 149 0.869 15.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:9682030_9883460|GENSCAN_predicted_peptide_1|249_aa MALLLGLPQTLRRLEGKQTSSYSVCPQHEAVHTEPLDELYKVLAETLMAKGSTQGHWSYL LPSGGSVTLSESTAIISHGTTGLVTWDTALYLAEWAENPAAFTHRTVLELGSGASLTGLA ICKMCRPRAYIFSDCHSRVLEQLRGNVLNGLSLEADITSNLDSPRVTLAQLDWDVATVHQ LSAFQPDVVIAADVLYCPEAIVSLVRVLWRLAACREHQRAPEVYVAFTVRNPETCQLFTT ELEQYEEFV >gi568815594f:9682030_9883460|GENSCAN_predicted_CDS_1|750_bp atggccttgctcctgggcctgcctcaaaccctccgcagattagaaggaaagcaaacaagc tcttactctgtctgcccccagcacgaggctgtccacacggagcctttggacgagctgtac aaggtgctggcagagaccctgatggccaaggggtccacccagggccactggagctatttg ctgccctcgggaggctccgtcacactctccgagagtacggccatcatctcccacggcacc acaggcctggtcacatgggacactgccctctaccttgcagaatgggccgagaacccagca gccttcactcacaggactgtcttagagcttggcagtggtgccagcctcacaggcctggcc atctgcaagatgtgccgcccccgggcatacatcttcagcgactgtcacagccgggtcctc gagcagctccgagggaatgtcctcaatggcctctcattagaggcagacatcacttccaac ttagacagccccagggtgacattggcccagctggactgggacgtcgcgacggtccatcag ctctctgccttccagccagatgttgtcattgcagcagacgtgctgtattgcccagaagcc attgtgtcgctggtcagggtcctgtggaggctggctgcctgccgggagcaccagcgggct cctgaggtctatgtggccttcactgtccgcaacccagagacgtgccagctgttcaccacc gagctagaacagtatgaggagtttgtgtga >gi568815594f:9682030_9883460|GENSCAN_predicted_peptide_2|353_aa MSKNAAILMPEVEEEGQLEQPHGFSNMEAIKNHFESIWNGGWGQKLVWKRWKTVWSMDDN FEVNKQHPVWTAATLAKDGKNQSAQAVTVYDKPASFFKEAPLDLQHWLFMKLGSTHSLFR ARLFFCSFISEPEDPATERSAFMEQDAGSRLVTRLHEQPALLVSSTSWTGKRPLREYYSC LIHQKHFQHIQVCTPWLEAEDYPLLLAWSVDLGVCLHMSSSGLDLPMKVVDMFGCCLPVC AMNFKCLHELVKHEENGLVLEDSEELAAQLQMLFSNFPDPEGKLNQFRKNLRESQKLRWD ESWVQTVLPLVMDTELLGQRLKPQGPCCPSRSFFSESQGKSFRVAPPSGQKLI >gi568815594f:9682030_9883460|GENSCAN_predicted_CDS_2|1062_bp atgtccaagaacgcagctatcctgatgccagaagttgaggaagaaggacagttagaacaa ccccatggtttcagcaacatggaggccatcaagaaccattttgaatccatatggaatggg gggtggggacagaagcttgtttggaaaaggtggaaaactgtttggtctatggatgacaat ttcgaggtgaacaaacaacatcctgtttggacagctgctactctggctaaagatggtaaa aatcagtcagctcaggctgtgaccgtctacgacaaaccggcatctttctttaaagaggca cctctggacctgcagcactggctcttcatgaagctgggcagcacacactctctgttcagg gcccggctttttttctgctctttcatctcagaacctgaggatccagccacggagcggtcg gccttcatggagcaggatgctgggagcaggctggtgacacgtctccatgagcagccagcc ctgctggtcagcagcacgagctggacaggcaaacggcctctgagggagtattacagctgc ctcatccaccagaagcatttccagcatatccaggtctgcaccccctggctggaggccgag gactaccccctgcttctagcatggtcggtggacctgggtgtctgtctgcacatgtcctcc agtggcctggacctgcccatgaaggtggtggacatgttcgggtgctgtttgcctgtgtgt gccatgaacttcaagtgtttacatgagctggtgaaacatgaagaaaacggcctggtcctt gaggactcagaggaactggcagctcagctgcagatgcttttctcaaactttcctgaccct gagggcaagctaaaccagttccggaagaacctgcgggagtcgcagaagctccgatgggat gagagctgggtgcagactgtgctccctttggttatggacacagaactcctgggccagagg ctaaaaccccagggcccctgctgtccttcccgcagcttcttctcggagtctcagggcaaa tcctttcgagtagcgcctcccagtggccagaagctgatatga >gi568815594f:9682030_9883460|GENSCAN_predicted_peptide_3|206_aa MSSGTWREPINSFWWKLMVPSIFVSASDELKNKTVCEDKELKLHCHESKFLNIYSVTYGR RTQERDICSSKPERLPPFRQKVSNLYRPTESNGCLYIPPPRTTEDPERQPVLTGLFLSMC LVTVLGKLLIMLAFSPDSHLHTHMYFFLSNLSLPDISFTSTIVPKMIVDIQSHSRVISYA GRLTQMSLFAIFGGMEDRNAPECDGL >gi568815594f:9682030_9883460|GENSCAN_predicted_CDS_3|621_bp atgagcagtgggacttggagagagcccatcaattccttctggtggaagctgatggtgcct tcaatttttgtttctgcttcagatgaattaaaaaacaaaaccgtgtgtgaagacaaggag ctgaaactgcactgccatgaatccaagttcctcaacatctactctgtgacatatggcagg aggacccaggaaagggacatctgctcctccaagccagagcggctccccccttttcggcaa aaggtgtccaatctctacagacccacagaatctaatggatgtctctatattcctcctcct agaaccacagaggatccagaacggcagccggtcctcactgggctgttcctgtccatgtgc ctggtcacagtgctggggaagctgctcatcatgttggccttcagccctgactcccacctc cacacccacatgtacttcttcctctccaacctgtccttgcctgacatcagtttcacctcc accattgtccccaagatgattgtggacatccagtctcacagcagagtgatctcctatgca ggccgcctgactcagatgtctctctttgccatttttggaggcatggaagacagaaatgct cctgagtgtgatggcctatga >gi568815594f:9682030_9883460|GENSCAN_predicted_peptide_4|856_aa MTVRGRDIDFLVNRGAEHSLVTAPVAPLSKKMIDIIGAMGVSAKQAFCLPRTRTVGGHKV IHQFWYMPDCPLPFMGRDLLSKLRATISLTEHGSLLPKLPRTGVIMTLMVPERRNGDFSE LSQAKREDQLWLSGGQEYGRKTTLRDWPVKTGAQPVRQKQDPVPREALQGIQVRLKHLRT FGIIVPCQSAWNTPLLPVPKPRTKDYRPVQDLRLLHQATLTFPPTVPNPSTLLGLLPAED SWFTCFDLKDAFFPIRLAPERQKLFAFQWEDPESGISETSVLGDWSKPRTFDIKENPDFA RRNKMSSVRHSRCRCRPARYRLPRTALAVSEHQPLPVPIAETGGAHHGHGARGASGGKRS PRALRSPAQLMTAPAVQPEMLPPGSNGTAYPGQFALYQQLAQGNAVGGSAGAPPLGPSQV VTACLLTLLIIWTLLGNVLVCAAIVRSRHLRANMTNVFIVSLAVSDLFVALLVMPWKAVA EVAGYWPFGAFCDVWVAFDIMCSTASILNLCVISVDRYWAISRPFRYKRKMTQRMALVMV GLAWTLSILISFIPVQLNWHRDQAASWGGLDLPNNLANWTPWEEDFWEPDVNAENCDSSL NRTYAISSSLISFYIPVAIMIVTYTRIYRIAQVQIRRISSLERAAEHAQSCRSSAACAPD TSLRASIKKETKVLKTLSVIMGVFVCCWLPFFILNCMVPFCSGHPEGPPAGFPCVSETTF DVFVWFGWANSSLNPVIYAFNADFQKVFAQLLGCSHFCSRTPVETVNISNELISYNQDIV FHKEIAAAYIHMMPNAVTPGNREVDNDEEEGPFDRMFQIYQTSPDGDPVAESVWELDCEG EISLDKITPFTPNGFH >gi568815594f:9682030_9883460|GENSCAN_predicted_CDS_4|2571_bp atgacagtccggggtagagacattgattttcttgtaaatagaggtgctgaacattcgcta gtaactgccccggttgcccccttatccaaaaagatgattgacatcatcggagccatgggg gtttcagcaaagcaagctttctgcttgcctcggactcgtactgtaggaggacataaagtc attcatcagttttggtacatgcctgactgtcccttgccctttatgggaagggacttgctc agcaagctgagagccactatctctttgacagagcacggctctttgctgccaaagttaccc agaacgggagtcattatgacccttatggtccccgagaggaggaatggagacttttctgaa ctgagccaggccaagagagaagaccagctctggctaagcggtggccaagagtacgggcgg aagacaaccctccgggattggccagttaagactggggcccagccggttaggcaaaaacag gacccggtccccagagaagcccttcaaggtatccaggtccgtctcaagcacctaagaact tttggaattattgttccttgtcagtctgcgtggaacactcccctcctgcctgttcccaag ccacggaccaaggactaccggccggtacaggatttgcgcttgcttcatcaagctacactg actttccctccaacagtacctaacccgtccacattgttggggttgctgccagctgaggac agctggttcacctgctttgacctgaaagatgctttctttcctatcagattagcccccgag aggcagaagctgtttgcctttcagtgggaagatccggagtcaggcatctctgaaacatca gttttgggtgattggtctaagccaaggacctttgatatcaaggaaaatcctgattttgcc cggagaaacaaaatgtccagtgtcagacacagccgctgccgctgccgtccggcgcgctac agactcccgagaacagccctggctgtcagcgagcaccagccgcttcctgtccccatcgcg gagactggaggggcgcaccacggccatggagccagaggcgcttcaggaggcaagagaagt ccccgcgcgctccgcagcccggcgcagctcatgaccgcccctgcagtccagcccgaaatg ctgccgccaggcagcaacggcaccgcgtacccggggcagttcgctctataccagcagctg gcgcaggggaacgccgtggggggctcggcgggggcaccgccactggggccctcacaggtg gtcaccgcctgcctgctgaccctactcatcatctggaccctgctgggcaacgtgctggtg tgcgcagccatcgtgcggagccgccacctgcgcgccaacatgaccaacgtcttcatcgtg tctctggccgtgtcagaccttttcgtggcgctgctggtcatgccctggaaggcagtcgcc gaggtggccggttactggccctttggagcgttctgcgacgtctgggtggccttcgacatc atgtgctccactgcctccatcctgaacctgtgcgtcatcagcgtggaccgctactgggcc atctccaggcccttccgctacaagcgcaagatgactcagcgcatggccttggtcatggtc ggcctggcatggaccttgtccatcctcatctccttcattccggtccagctcaactggcac agggaccaggcggcctcttggggcgggctggacctgccaaacaacctggccaactggacg ccctgggaggaggacttttgggagcccgacgtgaatgcagagaactgtgactccagcctg aatcgaacctacgccatctcttcctcgctcatcagcttctacatccccgttgccatcatg atcgtgacctacacgcgcatctaccgcatcgcccaggtgcagatccgcaggatttcctcc ctggagagggccgcagagcacgcgcagagctgccggagcagcgcagcctgcgcgcccgac accagcctgcgcgcttccatcaagaaggagaccaaggttctcaagaccctgtcggtgatc atgggggtcttcgtgtgttgctggctgcccttcttcatccttaactgcatggtccctttc tgcagtggacaccccgaaggccctccggccggcttcccctgcgtcagtgagaccaccttc gacgtcttcgtctggttcggctgggctaactcctcactcaaccccgtcatctatgccttc aacgccgactttcagaaggtgtttgcccagctgctggggtgcagccacttctgctcccgc acgccggtggagacggtgaacatcagcaatgagctcatctcctacaaccaagacatcgtc ttccacaaggaaatcgcagctgcctacatccacatgatgcccaacgccgttacccccggc aaccgggaggtggacaacgacgaggaggagggtcctttcgatcgcatgttccagatctat cagacgtccccagatggtgaccctgttgctgagtctgtctgggagctggactgcgagggg gagatttctttagacaaaataacacctttcaccccgaatggattccattaa >gi568815594f:9682030_9883460|GENSCAN_predicted_peptide_5|155_aa MPRGHFHTLCPQKTSGGTKEQHEQRAMVMGIDWVLATCKCSVLSTLETLTHLLSQQIQVV GPLCPYEDIGFTQSSTLMTQTQDPYFNHISQAPFALDGCWVVDDPSHAERVSFARLSSPS DTLYGANHYVLLFLPGSVKQELLPFLLTPPPSKED >gi568815594f:9682030_9883460|GENSCAN_predicted_CDS_5|468_bp atgcccagagggcattttcacactctctgtccacaaaagacctcagggggaaccaaagag cagcatgagcagagggccatggtcatgggcattgactgggtgcttgctacatgcaaatgc tctgtgttaagcaccttggaaacactaactcatttactgtcccaacaaatccaggttgta ggacctttgtgtccttatgaggacattgggtttacccagtcatccacgttaatgacccaa actcaagatccttactttaatcacatcagccaagccccatttgccctagatggatgctgg gtcgtggatgaccctagccatgctgagcgggtgtcctttgcaagactttcatctccatct gacacgctgtatggggccaaccattatgtcctcttatttcttcctggatctgttaaacag gaacttctgccctttttactcactcctcccccaagcaaggaggactaa >gi568815594f:9682030_9883460|GENSCAN_predicted_peptide_6|87_aa MAQATLMLTTANAANRIPEPECGVLQNQREPHGQKTPCQSYLNEWEEEPNSKVGEPVDGA CNDEGSRPLRLLEELTSQDERDATCSV >gi568815594f:9682030_9883460|GENSCAN_predicted_CDS_6|264_bp atggcccaagcaacactgatgctgaccacagcaaatgctgccaacaggatcccagagcca gagtgtggagtgctgcagaatcaaagggaaccccatgggcaaaagactccttgtcagtca tacctgaatgaatgggaagaggagcccaacagcaaagttggagagccagttgacggtgcc tgcaatgatgaaggcagccggccgctgagattgctggaagaactcaccagtcaagatgaa cgggatgccacctgcagtgtgtga >gi568815594f:9682030_9883460|GENSCAN_predicted_peptide_7|34_aa MPATVTIIIFTSSYRSTIVIIITIIIFFITITSC >gi568815594f:9682030_9883460|GENSCAN_predicted_CDS_7|105_bp atgcctgccacagttaccatcatcatcttcaccagcagctacagaagcaccatcgtcatc atcattaccatcatcatcttcttcatcaccatcacttcttgctaa >gi568815594f:9682030_9883460|GENSCAN_predicted_peptide_8|115_aa MNAQMDTGNPAPASAPPPLVQGLAQVLATPPPPVSHPCCAAAIHHWCSYMQEHHHPTPAG APPQAMHVHPTALLQLLAHLSEHGSHCRSALAGTPIEVLWSLGWEHLGPFSVAGS >gi568815594f:9682030_9883460|GENSCAN_predicted_CDS_8|348_bp atgaatgcacaaatggacactggcaaccctgcccctgccagtgccccacccccactggtg caagggctagcacaggtgctggcaaccccacccccaccagtgtcccacccctgctgtgct gctgccattcatcactggtgcagctatatgcaggaacaccaccacccaactcctgctggt gccccaccccaggcaatgcacgtgcaccccactgcactgttgcagctgctggcacacttg agcgagcatggatcccactgccgaagtgctttggctggcacacccatcgaagtgttgtgg tcattgggctgggaacacctcggcccattcagcgtagcaggttcctaa >gi568815594f:9682030_9883460|GENSCAN_predicted_peptide_9|128_aa MDGAGGHYPKSTNIETENQIPHVLNYKWELNIEHTWTQGNNDTVAYLRVSSSNLHSLQLL LLSHILMLISKPLLEVRERLLSLQNRQPQNCGRTHCCHLWNFVTAATGNGYSPTEDCVLT ISAAARNN >gi568815594f:9682030_9883460|GENSCAN_predicted_CDS_9|387_bp atggatggagctggaggccattatcctaagtcaactaatatagaaacagaaaaccaaata ccacatgttctcaattataagtgggagctaaacattgagcacacatggacacaagggaac aatgacactgtggcctacttgagggtgtcttcctccaacttgcacagcctgcagctccta ttgctgtcacatatattaatgctgatatcaaagcccctgctggaagtcagggagcgcctg ctctctttgcagaaccggcagccccagaactgtgggagaacacattgctgtcatttgtgg aactttgtcacagcagccacaggaaatggatacagcccaacagaggactgtgttctcacc atctcagcagctgcaaggaataattaa >gi568815594f:9682030_9883460|GENSCAN_predicted_peptide_10|85_aa XQTCSEKKLEAETPARGLEQPAQGSYLGQCHPCKGVGIATMISAGHEGKKAVCFMRLEDG AEGSNPPSMASAALKPSNDPPESPH >gi568815594f:9682030_9883460|GENSCAN_predicted_CDS_10|258_bp nggcagacctgcagtgagaagaaactggaggcagagacaccagccaggggcctggagcag ccagcccagggcagttatctgggacagtgccacccctgcaaaggtgtgggcattgccact atgataagtgcaggtcatgagggcaagaaagctgtatgtttcatgaggctggaagatgga gctgaaggttccaaccctccaagcatggccagcgctgccttgaaaccatctaatgaccca cctgagtcaccacattag