GENSCAN 1.0 Date run: 3-Nov-116 Time: 00:09:52 Sequence gi568815576f:17481035_17690233 : 209199 bp : 45.51% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 16369 16552 184 1 1 37 91 302 0.957 25.19 1.02 Intr + 18376 18515 140 0 2 93 91 143 0.994 14.36 1.03 Intr + 22048 22097 50 1 2 102 50 26 0.989 -1.48 1.04 Intr + 23813 23982 170 0 2 104 91 152 0.863 16.77 1.05 Intr + 30779 30862 84 1 0 57 105 41 0.040 2.72 1.06 Intr + 43084 43237 154 0 1 83 95 223 0.957 22.15 1.07 Intr + 56000 56198 199 0 1 97 76 38 0.517 1.91 1.08 Intr + 57486 57523 38 0 2 78 96 9 0.596 -1.39 1.09 Intr + 57606 57697 92 1 2 82 70 77 0.798 5.01 1.10 Intr + 57959 58085 127 1 1 62 91 134 0.989 11.35 1.11 Intr + 59378 59682 305 0 2 60 64 226 0.886 13.71 1.12 Intr + 60805 60933 129 0 0 58 115 101 0.994 10.69 1.13 Intr + 61123 61969 847 0 1 92 86 236 0.577 14.75 1.14 Intr + 67114 68530 1417 2 1 107 90 541 0.933 44.67 1.15 Intr + 70997 71108 112 2 1 84 110 87 0.977 10.88 1.16 Intr + 88890 88952 63 2 0 86 111 17 0.082 2.71 1.17 Intr + 98711 98753 43 1 1 80 34 64 0.119 -1.99 1.18 Intr + 98843 98930 88 0 1 42 83 94 0.674 3.33 1.19 Intr + 100003 100125 123 1 0 106 51 194 0.999 17.20 1.20 Intr + 100324 100379 56 1 2 98 64 86 0.684 5.82 1.21 Intr + 101529 101629 101 1 2 118 75 233 0.962 24.83 1.22 Intr + 102296 102500 205 1 1 21 68 181 0.830 8.17 1.23 Intr + 106102 106267 166 2 1 121 99 -19 0.901 1.52 1.24 Intr + 106891 107045 155 1 2 122 91 106 0.605 14.02 1.25 Intr + 108556 108631 76 2 1 34 94 117 0.998 5.47 1.26 Term + 109061 109202 142 2 1 150 42 50 0.922 4.00 1.27 PlyA + 109831 109836 6 1.05 2.10 PlyA - 110513 110508 6 -0.45 2.09 Term - 111702 111640 63 0 0 105 46 89 0.982 4.29 2.08 Intr - 113582 113495 88 1 1 126 109 46 0.992 10.47 2.07 Intr - 117254 117160 95 2 2 94 75 111 0.990 9.16 2.06 Intr - 119100 118993 108 0 0 91 98 47 0.607 6.48 2.05 Intr - 120147 120058 90 0 0 97 80 58 0.983 6.09 2.04 Intr - 131844 131778 67 2 1 73 79 -23 0.174 -5.79 2.03 Intr - 132286 132177 110 1 2 95 101 109 0.758 11.98 2.02 Intr - 138492 138427 66 0 0 83 100 103 0.873 10.10 2.01 Init - 147601 147569 33 1 0 98 94 53 0.658 6.77 2.00 Prom - 151045 151006 40 -5.96 3.03 PlyA - 151168 151163 6 1.05 3.02 Term - 157830 157587 244 2 1 84 49 167 0.755 7.77 3.01 Init - 177853 177807 47 1 2 86 70 27 0.067 -1.14 3.00 Prom - 178799 178760 40 -4.56 4.03 PlyA - 178811 178806 6 1.05 4.02 Term - 184486 184070 417 1 0 60 48 244 0.861 12.88 4.01 Init - 187180 187178 3 1 0 113 81 0 0.614 1.80 4.00 Prom - 190648 190609 40 -6.16 5.00 Prom + 197693 197732 40 -4.36 5.01 Init + 199621 199636 16 0 1 61 107 -1 0.671 -0.23 5.02 Intr + 202180 202287 108 2 0 122 66 72 0.913 8.66 5.03 Intr + 207952 208108 157 2 1 66 81 96 0.144 5.77 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 77499 77249 251 1 2 92 54 120 0.922 4.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:17481035_17690233|GENSCAN_predicted_peptide_1|1755_aa XPQTFHSYLEDIINYRWELEEGKPNPLREASFQDLPLRTRVEILHRLCDYRLDADDVFDL LKGLDADSLRVEPLGEDNSGALYWYFYGTRMYKEDPVQGKSNGELSLSSEKQEENSLASE PQTRHGSQGPGQGTWWLLCQTEEEWRQVTESFRERTSLRERQLYKLLSEDFLPEICNMIA QKGKRPQRTKAELHPRWMSDHLSIKPVKQEETPVLTRIEKQKRKEEEEERQILLAVQKKE QEQMLKEERKRELEEKVKAVEVMKEHAMSVSGVCLAHSAFVFIVDRAKRRKLREERAWLL AQGKELPPELSHLDPNSPMREEKKTKDLFELDDDFTAMYKVLDVVKAHKDSWPFLEPVDE SYAPNYYQIIKAPMDISSMEKKLNGGLYCTKEEFVNDMKTMFRNCRKYNGESSEYTKMSD NLERCFHRAMMKHFPGEDGDTDEEFWIREDEKREKRRSRAGRSGGSHVWTRSRDPEGSSR KQQPMENGGKSLPPTRRAPSSGDDQSSSSTQPPRERPAVPGTFGPLRGSDPATLYGSSGV PEPHPGEPVQQRQPFTMQPPVGINSLRGPRLGTPEEKQMCGGLTHLSNMGPHPGSLQLGQ ISGPSQDGSMYAPAQFQPGFIPPRHGGAPARPPDFPESSEIPPSHMYRSYKYLNRVHSAV WNGNHGATNQGPLGPDEKPHLGPGPSHQPRTLGHVMDSRVMRPPVPPNQWTEQSGFLPHG VPSSGYMRPPCKSAGHRLQPPPVPAPSSLFGAPAQALRGVQGGDSMMDSPEMIAMQQLSS RVCPPGVPYHPHQPAHPRLPGPFPQVAHPMSVTVSAPKPALGNPGRAPENSEAQEPENDQ AEPLPGLEEKPPGVGTSEGVYLTQLPHPTPPLQTDCTRQSSPQERETVGPELKSSSSESA DNCKAMKGKNPWPSDSSYPGPAAQGCVRDLSTVADRGALSENGVIGEASPCGSEGKGLGS SGSEKLLCPRGRTLQETMPCTGQNAATPPSTDPGLTGGTVSQFPPLYMPGLEYPNSAAHY HISPGLQGVGPVMGGKSPASHPQHFPPRGFQSNHPHSGGFPRYRPPQGMRYSYHPPPQPS YHHYQRTPYYACPQSFSDWQRPLHPQGSPSGPPASQPPPPRSLFSDKNAMASLQGCETLN AALTSPTRMDAVAAKVPNDGQNPGPEEEKLDESMERPESPKEFLDLDNHNAATKRQSSLS ASEYLYGTPPPLSSGMGFGSSAFPPHSVMLQTGPPYTPQRPASHFQPRAYSSPVAALPPH HPGATQPNGLSQEGPIYRCQEEGLGHFQAVMMEQIGTRSGIRGPFQEMYRPSGMQMHPVQ SQASFPKTPTAATSQEEVPPHKPPTLPLDQLSSDLNYILGSRKGRGSYRKQGRKPQPKEV VTCREDSPNSGYPKEPAALCPGIPSPCRMTHQDLSITAKLINGGVAGLVGVTCVFPIDLA KTRLQNQHGKAMYKGMIDCLMKTARAEGFFGMYRGAAVNLTLVTPEKAIKLAANDFFRRL LMEDGYGQGKSPSHKRAAAPGQACGLRLKEPDACSHRMQRNLKMEMLAGCGAGMCQVVVT CPMEMLKIQLQDAGRLAVHHQGSASAPSTSRSYTTGSASTHRRPSATLIAWELLRTQGLA GLYRGLGATLLRDIPFSIIYFPLFANLNNLGFNELAGKASFAHSFVSGCVAGSIAAVAVT PLDVLKTRIQTLKKGLGEDMYSGITDCARKLWIQEGPSAFMKGAGCRALVIAPLFGIAQG VYFIGIGERILKCFD >gi568815576f:17481035_17690233|GENSCAN_predicted_CDS_1|5268_bp nngcctcagacattccacagctacctagaggacatcatcaactaccgctgggagctcgaa gaagggaagcccaaccctctgagggaagccagtttccaggacctgcctcttcgcacacgg gtggagatcctgcaccgactctgtgattaccggctggatgcagacgatgtcttcgatctt ctaaagggcctggatgcagacagtctccgtgtggagccattgggtgaagacaattctggg gcactatattggtatttctatggaacacgaatgtacaaagaggacccggtgcaaggaaaa tccaatggagaactctctttgagcagtgaaaagcaggaagaaaattccttggcatccgag ccacagacaagacatgggtcccaagggccaggccaaggtacttggtggctcctgtgccag acagaagaggaatggagacaggtcaccgagagttttcgcgagaggacctcccttcgagaa cggcagctctacaagctcctcagtgaggacttcctgcctgagatctgcaacatgatcgcc cagaagggaaaacgtccacagcgcacaaaggcagagttgcatcctaggtggatgtctgac cacctgtccatcaaacccgtcaagcaagaggagactcctgtgctgaccagaatagaaaaa caaaagcgcaaagaggaggaagaagagcgtcagattcttctagcagtgcagaagaaggag caggagcagatgctaaaggaagagaggaaacgcgagttggaggagaaggtcaaggcagtg gaagtgatgaaggaacacgccatgtcagtgtctggagtgtgcttagctcactcagccttt gtgttcattgtagatcgagcgaagaggagaaagctcagggaagaaagggcatggctgctg gctcaaggaaaggagctccctccagaactttcccatctggaccccaattcccccatgaga gaggaaaaaaagactaaagacctctttgagttggatgatgatttcactgctatgtataaa gttctagacgtggtaaaggctcacaaggattcctggcccttcttggaacctgtggatgaa tcttatgcccctaactattatcagattattaaggcccccatggatatttccagcatggag aagaaactgaatggaggtttatactgtaccaaggaggaatttgtaaatgacatgaagacc atgttcaggaattgtcgaaagtataatggggaaagtagtgagtataccaagatgtctgat aatttagagaggtgtttccatcgggcaatgatgaaacattttcctggagaagatggagac acagatgaagaattttggattcgagaggatgaaaagcgggagaaaagacggagtcgggct gggcgaagtggtgggagccatgtttggacccgctccagggacccagaagggtccagcagg aaacagcagcccatggagaatggaggaaagtcgttgccccccacacgccgagcgccctct tctggggacgatcagagcagcagctccacacagcccccgcgggagaggccagcagtacca ggaacatttggccctctgcgaggatcagatcctgccaccttgtatggctcctctggagtc ccggagccacaccccggggagcctgtgcagcagcgtcagcctttcaccatgcagcctcca gttggaattaacagcctccgaggacccaggctaggcacaccagaggagaagcaaatgtgc ggggggctgacacacctttctaacatgggcccacaccctggatccttgcagcttgggcag ataagtggcccaagtcaggatggaagcatgtatgctccagctcagttccagccaggattc attcctccccggcatgggggggctccagcccggccaccagactttcctgaaagctcagaa attcctcccagccatatgtatcgatcgtacaagtacctgaatcgagtacactctgccgtc tggaatgggaaccatggtgctacgaaccaaggacccttgggcccagatgagaagccccac ctggggccaggaccctctcaccagcctcgcactctcggtcacgtgatggattcccgagtc atgagaccacctgtcccccccaaccagtggactgaacaatcaggcttcctacctcatgga gttccttcctcagggtacatgcgaccgccctgcaagtctgccggacatcggttacagcca cctccagtgccagcacccagttctttgtttggagcacctgcccaggctcttcggggggtg cagggaggggactccatgatggacagcccagagatgattgcgatgcagcagctctcctcc cgcgtctgccccccaggtgtgccttaccacccccaccagcctgcacacccccgtttacct ggcccttttccgcaggtagctcacccaatgtcagtcactgtgtcagcccccaagcctgcc ctgggcaaccctgggagggcaccggagaacagtgaagcacaagagcctgagaatgaccaa gcagagccgttgcctggccttgaagagaaaccaccaggtgttggtacttcagagggggtc tacctcacacaactacctcaccccacacctcccctgcagactgactgcaccaggcagagc tcaccacaagaaagggaaacagtgggcccggagctcaaaagcagctcctccgaatctgcg gacaactgtaaagcaatgaagggcaagaatccctggccctcggatagcagctaccccggc ccagccgcccaagggtgcgtgagagacctctccacggtggcagacaggggcgctctatcc gagaacggagtcattggggaagcatctccttgtggatcggaggggaagggccttggtagc agtggttccgaaaagctgctctgccccagaggcagaacgttgcaggaaaccatgccatgc acgggacagaacgcagcgacaccgcccagcacagaccccggtttgacgggaggcactgtg agccagtttcccccgctgtatatgcctggcctagagtacccgaattcagctgcccattac cacatcagtccaggcctgcagggtgtgggccctgtgatgggagggaagtccccagcatcc catccccagcattttcccccaaggggctttcagtctaaccacccacattctggaggcttt ccccggtatcgccccccacaaggaatgaggtattcctaccacccaccgccacagccttcc taccaccactatcagcgaactccttactatgcctgtccacagagcttttctgactggcag agacctctccatccccagggaagcccaagcggacccccagccagtcagcctcccccacca aggtccctcttctcagataagaatgccatggccagtctgcaaggctgtgagacactgaat gctgccttaacttctccaacccgtatggatgcagtggctgctaaagtcccaaatgacggg cagaatcctggtccagaggaagagaagctggatgaatctatggagaggccagagagtccc aaagaatttttagacctggacaaccataacgcagctaccaagcggcagagctcgttgtca gccagcgagtatctctatggaactcctccgcctctgagttcaggaatgggatttggttca tctgcatttccaccccacagtgtgatgctgcagacggggcctccctatacccctcagcgg ccggccagtcactttcagcccagggcttactcttcccctgtggctgccctcccacctcac cacccaggggccacccagcccaacggcctctctcaggagggtcccatctatcgctgccag gaagaaggcctgggtcactttcaagctgtgatgatggaacaaattggcactagaagtgga ataagaggacctttccaggaaatgtacagaccatcaggaatgcagatgcacccggtccag tcgcaggcctcgttcccaaagacccccacagcagcaacatcacaggaggaggtgccgcct cataagcctccaacacttcccctggatcagctcagcagcgatctgaactatatcctgggt tccagaaaaggcagaggttcttaccgaaagcaggggaggaagccgcagcccaaggaggtc gtcacttgccgggaagacagccccaacagcggctaccccaaggagccagcagccttgtgt cctgggatccccagcccctgcagaatgacccaccaggatctgagcatcacagccaaactc atcaatggaggtgtagcagggctcgtgggggtgacctgcgtgttccccatcgacttggcc aagactcgcctgcagaaccagcatgggaaagccatgtacaaaggaatgatcgactgcctg atgaagacggctcgggcggagggcttcttcggcatgtaccgaggggctgcagtgaacctc actctggtcactccagagaaggccatcaagctggcggccaacgactttttccggcggctg ctcatggaagatgggtatggccagggaaagtccccctctcacaagagggcagctgcacca gggcaggcctgtggcctgcggctgaaggagcctgacgcctgttcccataggatgcagcgg aacctgaagatggagatgcttgccgggtgtggggctgggatgtgccaggtcgtggtgacc tgtcccatggaaatgctcaagattcagctgcaggatgctggacgcctggccgtccatcat cagggctcggcctcagcaccctccacctccaggtcctacacaactggttcggcttccacc cacaggcgcccctctgccaccctcattgcctgggagctgctccgcactcagggcctggct gggctctacaggggcctgggtgccactctcctcagagacattcctttctccatcatctac ttcccactgtttgccaaccttaacaacctggggttcaacgagctcgccggtaaggcgtcc tttgcacattccttcgtgtcaggctgtgtggcaggttccatagctgcggtcgcagtgacg cctctagatgttctgaaaactcgaatccaaaccctcaagaaaggcctgggcgaggacatg tacagtgggatcaccgactgtgccaggaaactctggattcaggagggaccatctgccttc atgaaaggcgctggctgccgggcactggtcatagcacctctctttgggattgctcaaggg gtctattttattgggattggagagcgcatcttaaagtgttttgactag >gi568815576f:17481035_17690233|GENSCAN_predicted_peptide_2|239_aa MALSDADVQKQIKHMMAFIEQEANEKAEEIDAKAEEEFNIEKGRLVQTQRLKIMEYYEKK EKQIEQQKKIQMSNLMNQARLKVLRARDDLITDLLNEAKQRLSKVVKDTTRYQVLLDGLV LQFAFSTDCIYYFFQGLYQLLEPRMIVRCRKQDFPLVKAAVQKAIPMYKIATKNDVDVQI DQESYLPEDIAGGVEIYNGDRKIKVSNTLESRLDLIAQQMMPEVRGALFGANANRKFLD >gi568815576f:17481035_17690233|GENSCAN_predicted_CDS_2|720_bp atggctctcagcgatgctgacgtgcaaaagcagataaagcatatgatggctttcattgaa caagaagccaatgagaaagcagaagaaatagatgcaaaggcagaagaagagttcaacata gagaaaggtcggcttgtgcaaacccaaagactaaagattatggaatattatgagaagaaa gagaaacagattgagcagcagaagaaaattcagatgtccaatttgatgaatcaagcgaga ctcaaagtcctcagagcaagagatgaccttatcacagacctactaaatgaagcaaaacag agactcagcaaggtggtaaaagatacaaccaggtaccaagtgctgctggatggactggtt ctccagtttgcattttctactgactgtatctactatttcttccagggtttgtaccagttg ctggagccccgaatgattgttcgttgcaggaaacaagatttccctctggtaaaggctgca gtgcagaaggcaattcctatgtacaaaattgccaccaaaaacgatgttgatgtccaaatt gaccaggagtcctacctgcctgaagacatagctggtggagttgagatctataatggagat cgtaaaataaaggtttccaacaccctggaaagccggctggatctcatagcccagcagatg atgccagaagtccggggagccttgtttggtgcaaatgccaacaggaagtttttggactaa >gi568815576f:17481035_17690233|GENSCAN_predicted_peptide_3|96_aa MGFHHLGQAGLELLTLRDERGGGGSCFRQGPPAAVSAPTRGPNLPPPPCWRVRSPRRAFV TAGVGAGPELPDIRNGIRSGRTDQRRPRRSLRPDVT >gi568815576f:17481035_17690233|GENSCAN_predicted_CDS_3|291_bp atggggtttcaccatcttggccaggctggtcttgaacttctgaccttaagagatgaaaga ggcggcggcggcagttgcttccgacagggtcccccagcggcggtgagtgctccgacccgc ggccctaatctaccgccgccgccatgttggagggtgaggtcaccccggcgtgccttcgtc acggccggcgtcggggcgggacctgaactacctgacatccggaacggtatccggagcgga aggaccgaccaacggaggccccggaggtctctccgtccagatgtgacctga >gi568815576f:17481035_17690233|GENSCAN_predicted_peptide_4|139_aa MKIGTGSGALLKHTRKCEATLEMGNKQAEVGTVWKAQETTGKMWESLELPRDLLNGFDQN ADSDMGDKVQVEVVSDGDKELIGNWNKGDSCYVLAKRLVAFCPCPRDLWNFKPETDDLGY LAEEISEQQSFKRKQSIKV >gi568815576f:17481035_17690233|GENSCAN_predicted_CDS_4|420_bp atgaaaattggtactgggagtggggcactgcttaaacatacccgaaaatgtgaagcaact ttggaaatgggtaacaaacaggcagaagttggaacagtgtggaaggctcaggaaacaaca ggaaaaatgtgggagagtctggaacttcctagagacttattgaatggttttgaccaaaat gctgacagtgatatgggcgataaagtccaagttgaggtggtctcagatggagataaggaa cttattgggaactggaacaaaggtgactcttgctatgttttagcaaagagactggtggcg ttttgcccctgccctagagatctgtggaactttaaacctgagacagatgatttagggtat ctggcagaagaaatctctgagcaacaaagcttcaagaggaagcaaagcataaaagtttga >gi568815576f:17481035_17690233|GENSCAN_predicted_peptide_5|94_aa MHDSRGVQLDIASQSLDQEILLKVKTEIEEELKSLDKEISEAFTSTGFDRHTSPVFSPAN PESSMEDCLAHLGEKVSQELKEPLHKALQMLLSH >gi568815576f:17481035_17690233|GENSCAN_predicted_CDS_5|282_bp atgcatgattcacgaggggttcaactagatatagcttcacaatctctggatcaagaaatt ttattaaaagttaaaactgaaattgaagaagagctaaaatctctggacaaagaaatttct gaagccttcaccagcacaggctttgaccgtcacacttctccagtgttcagccctgccaat ccagaaagctcaatggaagactgcttggcccatcttggagaaaaagtgtcccaggaactg aaagagcctctccataaagcattgcaaatgctcctgagccan