GENSCAN 1.0 Date run: 3-Nov-116 Time: 05:04:47 Sequence gi568815576r:17492677_17719528 : 226852 bp : 44.92% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4727 4910 184 2 1 37 91 302 0.962 25.19 1.02 Intr + 6734 6873 140 1 2 93 91 143 0.994 14.36 1.03 Intr + 10406 10455 50 2 2 102 50 26 0.989 -1.48 1.04 Intr + 12171 12340 170 1 2 104 91 152 0.863 16.77 1.05 Intr + 19137 19220 84 2 0 57 105 41 0.040 2.72 1.06 Intr + 31442 31595 154 1 1 83 95 223 0.957 22.15 1.07 Intr + 44358 44556 199 1 1 97 76 38 0.517 1.91 1.08 Intr + 45844 45881 38 1 2 78 96 9 0.596 -1.39 1.09 Intr + 45964 46055 92 2 2 82 70 77 0.798 5.01 1.10 Intr + 46317 46443 127 2 1 62 91 134 0.989 11.35 1.11 Intr + 47736 48040 305 1 2 60 64 226 0.886 13.71 1.12 Intr + 49163 49291 129 1 0 58 115 101 0.994 10.69 1.13 Intr + 49481 50327 847 1 1 92 86 236 0.577 14.75 1.14 Intr + 55472 56888 1417 0 1 107 90 541 0.933 44.67 1.15 Intr + 59355 59466 112 0 1 84 110 87 0.977 10.88 1.16 Intr + 77248 77310 63 0 0 86 111 17 0.082 2.71 1.17 Intr + 87069 87111 43 2 1 80 34 64 0.119 -1.99 1.18 Intr + 87201 87288 88 1 1 42 83 94 0.674 3.33 1.19 Intr + 88361 88483 123 2 0 106 51 194 0.999 17.20 1.20 Intr + 88682 88737 56 2 2 98 64 86 0.684 5.82 1.21 Intr + 89887 89987 101 2 2 118 75 233 0.962 24.83 1.22 Intr + 90654 90858 205 2 1 21 68 181 0.830 8.17 1.23 Intr + 94460 94625 166 0 1 121 99 -19 0.901 1.52 1.24 Intr + 95249 95403 155 2 2 122 91 106 0.605 14.02 1.25 Intr + 96914 96989 76 0 1 34 94 117 0.998 5.47 1.26 Term + 97419 97560 142 0 1 150 42 50 0.922 4.00 1.27 PlyA + 98189 98194 6 1.05 2.10 PlyA - 98871 98866 6 -0.45 2.09 Term - 100060 99998 63 1 0 105 46 89 0.982 4.29 2.08 Intr - 101940 101853 88 2 1 126 109 46 0.992 10.47 2.07 Intr - 105612 105518 95 0 2 94 75 111 0.990 9.16 2.06 Intr - 107458 107351 108 1 0 91 98 47 0.607 6.48 2.05 Intr - 108505 108416 90 1 0 97 80 58 0.983 6.09 2.04 Intr - 120202 120136 67 0 1 73 79 -23 0.174 -5.79 2.03 Intr - 120644 120535 110 2 2 95 101 109 0.758 11.98 2.02 Intr - 126850 126785 66 1 0 83 100 103 0.873 10.10 2.01 Init - 135959 135927 33 2 0 98 94 53 0.658 6.77 2.00 Prom - 139403 139364 40 -5.96 3.03 PlyA - 139526 139521 6 1.05 3.02 Term - 146188 145945 244 0 1 84 49 167 0.755 7.77 3.01 Init - 166211 166165 47 2 2 86 70 27 0.067 -1.14 3.00 Prom - 167157 167118 40 -4.56 4.03 PlyA - 167169 167164 6 1.05 4.02 Term - 172844 172428 417 2 0 60 48 244 0.862 12.88 4.01 Init - 175538 175536 3 2 0 113 81 0 0.616 1.80 4.00 Prom - 179006 178967 40 -6.16 5.00 Prom + 186051 186090 40 -4.36 5.01 Init + 187979 187994 16 1 1 61 107 -1 0.727 -0.23 5.02 Intr + 190538 190645 108 0 0 122 66 72 0.990 8.66 5.03 Intr + 196310 196466 157 0 1 66 81 96 0.457 5.77 5.04 Intr + 203465 203534 70 2 1 47 64 61 0.554 -1.22 5.05 Intr + 209567 209710 144 1 0 46 51 182 0.508 10.78 5.06 Term + 222995 223087 93 1 0 51 45 94 0.215 -0.77 5.07 PlyA + 223208 223213 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 65857 65607 251 2 2 92 54 120 0.922 4.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:17492677_17719528|GENSCAN_predicted_peptide_1|1755_aa XPQTFHSYLEDIINYRWELEEGKPNPLREASFQDLPLRTRVEILHRLCDYRLDADDVFDL LKGLDADSLRVEPLGEDNSGALYWYFYGTRMYKEDPVQGKSNGELSLSSEKQEENSLASE PQTRHGSQGPGQGTWWLLCQTEEEWRQVTESFRERTSLRERQLYKLLSEDFLPEICNMIA QKGKRPQRTKAELHPRWMSDHLSIKPVKQEETPVLTRIEKQKRKEEEEERQILLAVQKKE QEQMLKEERKRELEEKVKAVEVMKEHAMSVSGVCLAHSAFVFIVDRAKRRKLREERAWLL AQGKELPPELSHLDPNSPMREEKKTKDLFELDDDFTAMYKVLDVVKAHKDSWPFLEPVDE SYAPNYYQIIKAPMDISSMEKKLNGGLYCTKEEFVNDMKTMFRNCRKYNGESSEYTKMSD NLERCFHRAMMKHFPGEDGDTDEEFWIREDEKREKRRSRAGRSGGSHVWTRSRDPEGSSR KQQPMENGGKSLPPTRRAPSSGDDQSSSSTQPPRERPAVPGTFGPLRGSDPATLYGSSGV PEPHPGEPVQQRQPFTMQPPVGINSLRGPRLGTPEEKQMCGGLTHLSNMGPHPGSLQLGQ ISGPSQDGSMYAPAQFQPGFIPPRHGGAPARPPDFPESSEIPPSHMYRSYKYLNRVHSAV WNGNHGATNQGPLGPDEKPHLGPGPSHQPRTLGHVMDSRVMRPPVPPNQWTEQSGFLPHG VPSSGYMRPPCKSAGHRLQPPPVPAPSSLFGAPAQALRGVQGGDSMMDSPEMIAMQQLSS RVCPPGVPYHPHQPAHPRLPGPFPQVAHPMSVTVSAPKPALGNPGRAPENSEAQEPENDQ AEPLPGLEEKPPGVGTSEGVYLTQLPHPTPPLQTDCTRQSSPQERETVGPELKSSSSESA DNCKAMKGKNPWPSDSSYPGPAAQGCVRDLSTVADRGALSENGVIGEASPCGSEGKGLGS SGSEKLLCPRGRTLQETMPCTGQNAATPPSTDPGLTGGTVSQFPPLYMPGLEYPNSAAHY HISPGLQGVGPVMGGKSPASHPQHFPPRGFQSNHPHSGGFPRYRPPQGMRYSYHPPPQPS YHHYQRTPYYACPQSFSDWQRPLHPQGSPSGPPASQPPPPRSLFSDKNAMASLQGCETLN AALTSPTRMDAVAAKVPNDGQNPGPEEEKLDESMERPESPKEFLDLDNHNAATKRQSSLS ASEYLYGTPPPLSSGMGFGSSAFPPHSVMLQTGPPYTPQRPASHFQPRAYSSPVAALPPH HPGATQPNGLSQEGPIYRCQEEGLGHFQAVMMEQIGTRSGIRGPFQEMYRPSGMQMHPVQ SQASFPKTPTAATSQEEVPPHKPPTLPLDQLSSDLNYILGSRKGRGSYRKQGRKPQPKEV VTCREDSPNSGYPKEPAALCPGIPSPCRMTHQDLSITAKLINGGVAGLVGVTCVFPIDLA KTRLQNQHGKAMYKGMIDCLMKTARAEGFFGMYRGAAVNLTLVTPEKAIKLAANDFFRRL LMEDGYGQGKSPSHKRAAAPGQACGLRLKEPDACSHRMQRNLKMEMLAGCGAGMCQVVVT CPMEMLKIQLQDAGRLAVHHQGSASAPSTSRSYTTGSASTHRRPSATLIAWELLRTQGLA GLYRGLGATLLRDIPFSIIYFPLFANLNNLGFNELAGKASFAHSFVSGCVAGSIAAVAVT PLDVLKTRIQTLKKGLGEDMYSGITDCARKLWIQEGPSAFMKGAGCRALVIAPLFGIAQG VYFIGIGERILKCFD >gi568815576r:17492677_17719528|GENSCAN_predicted_CDS_1|5268_bp nngcctcagacattccacagctacctagaggacatcatcaactaccgctgggagctcgaa gaagggaagcccaaccctctgagggaagccagtttccaggacctgcctcttcgcacacgg gtggagatcctgcaccgactctgtgattaccggctggatgcagacgatgtcttcgatctt ctaaagggcctggatgcagacagtctccgtgtggagccattgggtgaagacaattctggg gcactatattggtatttctatggaacacgaatgtacaaagaggacccggtgcaaggaaaa tccaatggagaactctctttgagcagtgaaaagcaggaagaaaattccttggcatccgag ccacagacaagacatgggtcccaagggccaggccaaggtacttggtggctcctgtgccag acagaagaggaatggagacaggtcaccgagagttttcgcgagaggacctcccttcgagaa cggcagctctacaagctcctcagtgaggacttcctgcctgagatctgcaacatgatcgcc cagaagggaaaacgtccacagcgcacaaaggcagagttgcatcctaggtggatgtctgac cacctgtccatcaaacccgtcaagcaagaggagactcctgtgctgaccagaatagaaaaa caaaagcgcaaagaggaggaagaagagcgtcagattcttctagcagtgcagaagaaggag caggagcagatgctaaaggaagagaggaaacgcgagttggaggagaaggtcaaggcagtg gaagtgatgaaggaacacgccatgtcagtgtctggagtgtgcttagctcactcagccttt gtgttcattgtagatcgagcgaagaggagaaagctcagggaagaaagggcatggctgctg gctcaaggaaaggagctccctccagaactttcccatctggaccccaattcccccatgaga gaggaaaaaaagactaaagacctctttgagttggatgatgatttcactgctatgtataaa gttctagacgtggtaaaggctcacaaggattcctggcccttcttggaacctgtggatgaa tcttatgcccctaactattatcagattattaaggcccccatggatatttccagcatggag aagaaactgaatggaggtttatactgtaccaaggaggaatttgtaaatgacatgaagacc atgttcaggaattgtcgaaagtataatggggaaagtagtgagtataccaagatgtctgat aatttagagaggtgtttccatcgggcaatgatgaaacattttcctggagaagatggagac acagatgaagaattttggattcgagaggatgaaaagcgggagaaaagacggagtcgggct gggcgaagtggtgggagccatgtttggacccgctccagggacccagaagggtccagcagg aaacagcagcccatggagaatggaggaaagtcgttgccccccacacgccgagcgccctct tctggggacgatcagagcagcagctccacacagcccccgcgggagaggccagcagtacca ggaacatttggccctctgcgaggatcagatcctgccaccttgtatggctcctctggagtc ccggagccacaccccggggagcctgtgcagcagcgtcagcctttcaccatgcagcctcca gttggaattaacagcctccgaggacccaggctaggcacaccagaggagaagcaaatgtgc ggggggctgacacacctttctaacatgggcccacaccctggatccttgcagcttgggcag ataagtggcccaagtcaggatggaagcatgtatgctccagctcagttccagccaggattc attcctccccggcatgggggggctccagcccggccaccagactttcctgaaagctcagaa attcctcccagccatatgtatcgatcgtacaagtacctgaatcgagtacactctgccgtc tggaatgggaaccatggtgctacgaaccaaggacccttgggcccagatgagaagccccac ctggggccaggaccctctcaccagcctcgcactctcggtcacgtgatggattcccgagtc atgagaccacctgtcccccccaaccagtggactgaacaatcaggcttcctacctcatgga gttccttcctcagggtacatgcgaccgccctgcaagtctgccggacatcggttacagcca cctccagtgccagcacccagttctttgtttggagcacctgcccaggctcttcggggggtg cagggaggggactccatgatggacagcccagagatgattgcgatgcagcagctctcctcc cgcgtctgccccccaggtgtgccttaccacccccaccagcctgcacacccccgtttacct ggcccttttccgcaggtagctcacccaatgtcagtcactgtgtcagcccccaagcctgcc ctgggcaaccctgggagggcaccggagaacagtgaagcacaagagcctgagaatgaccaa gcagagccgttgcctggccttgaagagaaaccaccaggtgttggtacttcagagggggtc tacctcacacaactacctcaccccacacctcccctgcagactgactgcaccaggcagagc tcaccacaagaaagggaaacagtgggcccggagctcaaaagcagctcctccgaatctgcg gacaactgtaaagcaatgaagggcaagaatccctggccctcggatagcagctaccccggc ccagccgcccaagggtgcgtgagagacctctccacggtggcagacaggggcgctctatcc gagaacggagtcattggggaagcatctccttgtggatcggaggggaagggccttggtagc agtggttccgaaaagctgctctgccccagaggcagaacgttgcaggaaaccatgccatgc acgggacagaacgcagcgacaccgcccagcacagaccccggtttgacgggaggcactgtg agccagtttcccccgctgtatatgcctggcctagagtacccgaattcagctgcccattac cacatcagtccaggcctgcagggtgtgggccctgtgatgggagggaagtccccagcatcc catccccagcattttcccccaaggggctttcagtctaaccacccacattctggaggcttt ccccggtatcgccccccacaaggaatgaggtattcctaccacccaccgccacagccttcc taccaccactatcagcgaactccttactatgcctgtccacagagcttttctgactggcag agacctctccatccccagggaagcccaagcggacccccagccagtcagcctcccccacca aggtccctcttctcagataagaatgccatggccagtctgcaaggctgtgagacactgaat gctgccttaacttctccaacccgtatggatgcagtggctgctaaagtcccaaatgacggg cagaatcctggtccagaggaagagaagctggatgaatctatggagaggccagagagtccc aaagaatttttagacctggacaaccataacgcagctaccaagcggcagagctcgttgtca gccagcgagtatctctatggaactcctccgcctctgagttcaggaatgggatttggttca tctgcatttccaccccacagtgtgatgctgcagacggggcctccctatacccctcagcgg ccggccagtcactttcagcccagggcttactcttcccctgtggctgccctcccacctcac cacccaggggccacccagcccaacggcctctctcaggagggtcccatctatcgctgccag gaagaaggcctgggtcactttcaagctgtgatgatggaacaaattggcactagaagtgga ataagaggacctttccaggaaatgtacagaccatcaggaatgcagatgcacccggtccag tcgcaggcctcgttcccaaagacccccacagcagcaacatcacaggaggaggtgccgcct cataagcctccaacacttcccctggatcagctcagcagcgatctgaactatatcctgggt tccagaaaaggcagaggttcttaccgaaagcaggggaggaagccgcagcccaaggaggtc gtcacttgccgggaagacagccccaacagcggctaccccaaggagccagcagccttgtgt cctgggatccccagcccctgcagaatgacccaccaggatctgagcatcacagccaaactc atcaatggaggtgtagcagggctcgtgggggtgacctgcgtgttccccatcgacttggcc aagactcgcctgcagaaccagcatgggaaagccatgtacaaaggaatgatcgactgcctg atgaagacggctcgggcggagggcttcttcggcatgtaccgaggggctgcagtgaacctc actctggtcactccagagaaggccatcaagctggcggccaacgactttttccggcggctg ctcatggaagatgggtatggccagggaaagtccccctctcacaagagggcagctgcacca gggcaggcctgtggcctgcggctgaaggagcctgacgcctgttcccataggatgcagcgg aacctgaagatggagatgcttgccgggtgtggggctgggatgtgccaggtcgtggtgacc tgtcccatggaaatgctcaagattcagctgcaggatgctggacgcctggccgtccatcat cagggctcggcctcagcaccctccacctccaggtcctacacaactggttcggcttccacc cacaggcgcccctctgccaccctcattgcctgggagctgctccgcactcagggcctggct gggctctacaggggcctgggtgccactctcctcagagacattcctttctccatcatctac ttcccactgtttgccaaccttaacaacctggggttcaacgagctcgccggtaaggcgtcc tttgcacattccttcgtgtcaggctgtgtggcaggttccatagctgcggtcgcagtgacg cctctagatgttctgaaaactcgaatccaaaccctcaagaaaggcctgggcgaggacatg tacagtgggatcaccgactgtgccaggaaactctggattcaggagggaccatctgccttc atgaaaggcgctggctgccgggcactggtcatagcacctctctttgggattgctcaaggg gtctattttattgggattggagagcgcatcttaaagtgttttgactag >gi568815576r:17492677_17719528|GENSCAN_predicted_peptide_2|239_aa MALSDADVQKQIKHMMAFIEQEANEKAEEIDAKAEEEFNIEKGRLVQTQRLKIMEYYEKK EKQIEQQKKIQMSNLMNQARLKVLRARDDLITDLLNEAKQRLSKVVKDTTRYQVLLDGLV LQFAFSTDCIYYFFQGLYQLLEPRMIVRCRKQDFPLVKAAVQKAIPMYKIATKNDVDVQI DQESYLPEDIAGGVEIYNGDRKIKVSNTLESRLDLIAQQMMPEVRGALFGANANRKFLD >gi568815576r:17492677_17719528|GENSCAN_predicted_CDS_2|720_bp atggctctcagcgatgctgacgtgcaaaagcagataaagcatatgatggctttcattgaa caagaagccaatgagaaagcagaagaaatagatgcaaaggcagaagaagagttcaacata gagaaaggtcggcttgtgcaaacccaaagactaaagattatggaatattatgagaagaaa gagaaacagattgagcagcagaagaaaattcagatgtccaatttgatgaatcaagcgaga ctcaaagtcctcagagcaagagatgaccttatcacagacctactaaatgaagcaaaacag agactcagcaaggtggtaaaagatacaaccaggtaccaagtgctgctggatggactggtt ctccagtttgcattttctactgactgtatctactatttcttccagggtttgtaccagttg ctggagccccgaatgattgttcgttgcaggaaacaagatttccctctggtaaaggctgca gtgcagaaggcaattcctatgtacaaaattgccaccaaaaacgatgttgatgtccaaatt gaccaggagtcctacctgcctgaagacatagctggtggagttgagatctataatggagat cgtaaaataaaggtttccaacaccctggaaagccggctggatctcatagcccagcagatg atgccagaagtccggggagccttgtttggtgcaaatgccaacaggaagtttttggactaa >gi568815576r:17492677_17719528|GENSCAN_predicted_peptide_3|96_aa MGFHHLGQAGLELLTLRDERGGGGSCFRQGPPAAVSAPTRGPNLPPPPCWRVRSPRRAFV TAGVGAGPELPDIRNGIRSGRTDQRRPRRSLRPDVT >gi568815576r:17492677_17719528|GENSCAN_predicted_CDS_3|291_bp atggggtttcaccatcttggccaggctggtcttgaacttctgaccttaagagatgaaaga ggcggcggcggcagttgcttccgacagggtcccccagcggcggtgagtgctccgacccgc ggccctaatctaccgccgccgccatgttggagggtgaggtcaccccggcgtgccttcgtc acggccggcgtcggggcgggacctgaactacctgacatccggaacggtatccggagcgga aggaccgaccaacggaggccccggaggtctctccgtccagatgtgacctga >gi568815576r:17492677_17719528|GENSCAN_predicted_peptide_4|139_aa MKIGTGSGALLKHTRKCEATLEMGNKQAEVGTVWKAQETTGKMWESLELPRDLLNGFDQN ADSDMGDKVQVEVVSDGDKELIGNWNKGDSCYVLAKRLVAFCPCPRDLWNFKPETDDLGY LAEEISEQQSFKRKQSIKV >gi568815576r:17492677_17719528|GENSCAN_predicted_CDS_4|420_bp atgaaaattggtactgggagtggggcactgcttaaacatacccgaaaatgtgaagcaact ttggaaatgggtaacaaacaggcagaagttggaacagtgtggaaggctcaggaaacaaca ggaaaaatgtgggagagtctggaacttcctagagacttattgaatggttttgaccaaaat gctgacagtgatatgggcgataaagtccaagttgaggtggtctcagatggagataaggaa cttattgggaactggaacaaaggtgactcttgctatgttttagcaaagagactggtggcg ttttgcccctgccctagagatctgtggaactttaaacctgagacagatgatttagggtat ctggcagaagaaatctctgagcaacaaagcttcaagaggaagcaaagcataaaagtttga >gi568815576r:17492677_17719528|GENSCAN_predicted_peptide_5|195_aa MHDSRGVQLDIASQSLDQEILLKVKTEIEEELKSLDKEISEAFTSTGFDRHTSPVFSPAN PESSMEDCLAHLGEKVSQELKEPLHKALQMLLSQPVTYQAFRECTLETTVHASGWNKILV PLVLLRQMLLELTRRGQEPLSALLQFGVTYLEDYSAEYIIQQGGWQGVTDKSYEFRSNKK DSDGQNWESILDFLD >gi568815576r:17492677_17719528|GENSCAN_predicted_CDS_5|588_bp atgcatgattcacgaggggttcaactagatatagcttcacaatctctggatcaagaaatt ttattaaaagttaaaactgaaattgaagaagagctaaaatctctggacaaagaaatttct gaagccttcaccagcacaggctttgaccgtcacacttctccagtgttcagccctgccaat ccagaaagctcaatggaagactgcttggcccatcttggagaaaaagtgtcccaggaactg aaagagcctctccataaagcattgcaaatgctcctgagccagccagtgacatatcaggca tttcgggaatgtacactggagaccacagttcatgccagcggctggaataagattttggtg cctctggttttgctacgacaaatgcttttggaattgacaagacgtggtcaagaacctttg agcgcactgctgcagtttggcgtgacatacctggaggactattcggcagagtacatcatt cagcaaggtggctggcaaggagtaacagataaatcctatgagtttagaagtaacaagaag gattcagatgggcagaattgggagagcattctggactttctggattga