GENSCAN 1.0 Date run: 4-Nov-116 Time: 13:34:55 Sequence gi568815587f:114339529_114549906 : 210378 bp : 38.66% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 665 948 284 0 2 77 9 169 0.185 4.11 1.02 Intr + 7689 7829 141 2 0 66 60 87 0.175 3.33 1.03 Intr + 8315 8401 87 1 0 57 115 62 0.214 5.05 1.04 Term + 8845 9312 468 0 0 62 42 229 0.960 9.69 1.05 PlyA + 10243 10248 6 1.05 2.05 PlyA - 10867 10862 6 1.05 2.04 Term - 24639 24580 60 0 0 92 49 52 0.561 -1.47 2.03 Intr - 26665 26472 194 2 2 120 66 136 0.739 12.89 2.02 Intr - 32053 31976 78 2 0 45 72 105 0.469 3.20 2.01 Init - 37266 37182 85 0 1 19 71 98 0.284 2.23 2.00 Prom - 37783 37744 40 -4.55 3.00 Prom + 43205 43244 40 -5.75 3.01 Init + 60028 60115 88 0 1 77 98 47 0.001 5.45 3.02 Intr + 60489 60721 233 1 2 53 6 221 0.000 6.97 3.03 Intr + 61126 61509 384 0 0 -6 26 266 0.000 5.12 3.04 Intr + 62170 62332 163 0 1 88 93 76 0.770 6.73 3.05 Intr + 63300 63387 88 1 1 64 90 16 0.509 -2.49 3.06 Intr + 66178 66271 94 1 1 67 66 42 0.462 -1.05 3.07 Intr + 74428 74596 169 0 1 107 70 123 0.277 11.00 3.08 Intr + 77012 77114 103 0 1 13 105 94 0.182 1.91 3.09 Intr + 86163 86250 88 0 1 102 15 88 0.005 1.95 3.10 Intr + 99938 100147 210 1 0 19 105 302 0.933 23.19 3.11 Intr + 101128 101211 84 0 0 66 62 67 0.665 1.00 3.12 Intr + 104328 104405 78 2 0 97 87 92 0.985 8.83 3.13 Intr + 105013 105124 112 0 1 42 98 103 0.867 5.73 3.14 Intr + 106451 106559 109 0 1 61 110 83 0.996 6.22 3.15 Intr + 108298 108351 54 1 0 45 97 78 0.712 1.58 3.16 Intr + 110318 110438 121 2 1 77 84 148 0.932 12.88 3.17 Intr + 113704 113833 130 0 1 58 52 45 0.094 -2.75 3.18 Intr + 127329 127421 93 1 0 70 94 69 0.457 4.72 3.19 Term + 134990 135096 107 0 2 23 44 161 0.526 2.79 3.20 PlyA + 135169 135174 6 1.05 4.00 Prom + 143948 143987 40 -2.75 4.01 Init + 149499 149765 267 2 0 68 41 197 0.202 10.03 4.02 Intr + 157836 157904 69 2 0 60 115 49 0.327 3.26 4.03 Intr + 164817 164956 140 2 2 36 80 116 0.100 3.94 4.04 Intr + 172813 172927 115 1 1 77 113 71 0.018 8.03 4.05 Term + 173261 173383 123 1 0 64 43 61 0.054 -3.40 4.06 PlyA + 174426 174431 6 1.05 5.06 PlyA - 175229 175224 6 1.05 5.05 Term - 182975 182440 536 0 2 65 49 392 0.979 26.42 5.04 Intr - 183563 183351 213 0 0 74 56 137 0.920 6.86 5.03 Intr - 188373 188312 62 2 2 80 98 58 0.977 3.46 5.02 Intr - 191380 190647 734 1 2 65 109 340 0.611 23.10 5.01 Init - 193961 193734 228 2 0 74 59 120 0.425 4.17 5.00 Prom - 194117 194078 40 -11.93 6.00 Prom + 194369 194408 40 -9.85 6.01 Sngl + 194800 195255 456 0 0 88 48 442 0.999 36.13 6.02 PlyA + 196041 196046 6 1.05 7.00 Prom + 196209 196248 40 -6.15 7.01 Init + 197478 198332 855 2 0 37 86 246 0.538 13.97 7.02 Term + 201110 201226 117 1 0 86 48 128 0.880 6.16 7.03 PlyA + 202470 202475 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 60803 60432 372 2 0 59 41 370 0.997 25.27 S.002 Init + 61144 61509 366 0 0 83 26 264 0.818 16.85 S.003 Sngl + 166992 167429 438 2 0 65 48 203 0.914 10.01 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:114339529_114549906|GENSCAN_predicted_peptide_1|326_aa XHVNWKSATTKNVTQTGLNVKENLLIYKPEKPKDIRLQSRLDSATQGQQRPRRLFISPLC VTQHVFSPSGHKMAASSFQRCDVLAHIQRDENVPQESPSAVWKLGDDRSCYFLLNCGVEG APQLRFPREGSLQCRSARMGWRDWVEVVMPPMPIPSPPSTPTPIMSGRTVQTSGLIPSSS RGQQTGIPSPIFKYIMAPALANMSFPSKGVGLSGIIRKACEKNKVPQSQFRGCEKYLVTG CPRTPLMVTDLEDSCLGRKSKTEKAAPVSRRKLVSWPSMVKLIWGSVRVMACSGACRRHP QSCCWTILLVASGPEDLLPQGQCAVQ >gi568815587f:114339529_114549906|GENSCAN_predicted_CDS_1|981_bp ntgcatgttaattggaaatctgcaactaccaaaaatgtcactcaaactggcttgaacgtt aaagaaaatttattgatttacaaacctgaaaagcctaaagatatcaggcttcagtcaaga cttgattcagcaactcaagggcagcaaagaccccggaggcttttcatttctcctctctgc gttacacaacatgtattttctccaagtggccacaagatggctgcaagtagctttcagcgc tgcgatgtccttgctcatatccaaagagacgagaatgtccctcaagaaagcccaagtgct gtttggaagttgggcgacgatcgttcttgctacttcctgctgaactgcggggtagaaggg gctccgcagttgaggtttcctcgggaggggagtcttcagtgtcgtagtgcaagaatgggt tggcgggattgggtggaggtggtaatgcctccaatgccgattcctagtcctcctagtacc ccaaccccaataatgagtggcaggacagtacagacttcaggattaattccttcctccagt agagggcagcaaacgggtattccttctcctatattcaagtatataatggcccctgcttta gctaatatgtccttccctagtaaaggagtagggctctcaggcataataagaaaggcatgt gagaaaaataaggttccccagtcacaatttagagggtgtgagaaatacctagtgactggc tgtcctaggacccctttgatggtgacagacctggaggatagttgtctgggacgaaagagt aaaactgagaaggccgcgccagtgtccaggaggaagttagtttcctggccctcaatggtt aagcttatctggggctctgtgagggtgatggcatgctctggcgcctgccgcaggcaccct cagtcctgttgttggaccatcttgttagtggcctctggcccagaggaccttctccctcag gggcagtgtgccgtccagtga >gi568815587f:114339529_114549906|GENSCAN_predicted_peptide_2|138_aa MTGELWTDDRTKGALKVSIAKQAVSLLISKCENQSRVLSTSPQTEKLEVEGILLGKSHCG EDRKEHNNACHQEEEAPDVNPLHGLQASGSPGKTQRIKSQALEGTRLYSHSEETRSGLIL PPSGNLRPTPFDLIRGRL >gi568815587f:114339529_114549906|GENSCAN_predicted_CDS_2|417_bp atgacaggagagctttggactgatgaccgcacaaagggtgctcttaaagtgagtattgcc aagcaagctgtcagtcttttgataagtaagtgtgagaaccagtcgagagtgttaagcacc agtccacagactgagaaattggaagtagaaggaatactgctgggaaagtcccactgtggg gaggataggaaagagcataacaatgcctgccaccaggaggaagaggccccagatgtcaac ccattgcatggtttgcaggcatcgggatcaccgggaaagactcaaagaatcaagtcacag gcactggagggaacaaggctttactcacatagcgaagagacaagatcaggcttaatactc cctccatcaggaaacttaagaccaactccatttgatctgatcagaggacgtctataa >gi568815587f:114339529_114549906|GENSCAN_predicted_peptide_3|835_aa MEDGKCQGKDGRINQCNKKEHFFFANRSKAHFLGDRVQRQGLGKVRIWLGGSGPGHLVPL GSGRHHPLDGSRPERLVAPPAEVLDESLQAWKRGSRLRRPSPTLTPAGGGDAEMGAAAAE ADRTLFVGNLETKVTEELLFELFHQVSGWVRPFAFRFPSRLGPGQRPPRFLFVAVRGPDG WLFGGERRVWVAKRSLGVAFLEDLGSFRPKVTTKGGGGILDQVSRGRDPGVLGNTAGPVI KVKIPKDKDGKPKQFAFVNFKHEVSVPYAMNLLNGIKLYGRPIKIQFRSGSSHAPQDVSL SYPQHHVGNSSPTSTSPSSRYERTMDNMTSSAQIIQRSFSSPENFQRQAVVVSSNLYTDQ LSHEYGRKPAGLREFSPCATYRFSVLCTGHSSCLCFAGLSAVSSELTIGLHISLAQNTEK FEEMCQQLEELLVGCCHMPTRHRSFTPLPPPLQTHEEYCQATANIHIRPKRLLRLRQRRL RDWGRGCWSRVMLGGSLGSRLLRGVGGSHGRFGARGVREGGAAMAAGESMAQRMVWVDLE MTGLDIEKDQIIEMACLITDSDLNILAEGPNLIIKQPDELLDSMSDWCKEHHGKSGLTKA VKESTITLQQAEYEFLSFVRQQTPPGLCPLAGNSVHEDKKFLDKYMPQFMKHLHYRIIDV STVKELCRRWYPEEYEFAPKKAASHRALDDISESIKELQFYRNNIFKKKIDEKKRKIIEN GENEKTRDFVDLRTDSSNYLPIRVMQWNNFIQALGKTKTTLYNALLKHSRGRWSFGSDDV APIAELSWRLKHVSSDPCQFLYQVRLTVIIGFSHSAVIDDFDESSSGGGMRTKPD >gi568815587f:114339529_114549906|GENSCAN_predicted_CDS_3|2508_bp atggaagatggtaaatgccaaggaaaagatggaaggataaatcagtgtaataaaaaggag cacttctttttcgccaacagaagtaaagcacacttcttaggagatcgggttcaacggcag ggattgggtaaggtgagaatctggcttggcggctccggccccggccatctggttcccttg ggctccggccgccaccatccactcgacggctctcggcccgaacgcttggtcgcaccgcct gccgaggtcctagatgaatcgcttcaggcctggaaacgaggaagccgtctccggagacca tcgccaacgctgacgcccgcgggagggggcgacgctgagatgggggcggcggcggcggaa gcggatcgcactctctttgtgggcaaccttgaaacgaaagtgaccgaggagctccttttc gagcttttccaccaggtaagcggctgggttcggccctttgcctttcgttttccgtctcgc ctagggcctggccagcggccaccccgttttcttttcgtagccgtcaggggacccgacggg tggctgtttgggggtgaaaggcgggtctgggttgcgaaacgctcgctgggtgtcgctttc ctggaagatcttggttcgtttaggccgaaagtgacgactaaaggtggtggagggatcctc gatcaggtttcccgtggtagagatccaggggtccttgggaacacagctgggccagtaata aaggtgaaaattccaaaagataaggatggtaaaccaaagcagtttgcgtttgtgaatttc aaacatgaagtgtctgttccttatgcaatgaatctacttaatggaatcaaactttatgga aggcctatcaaaattcaatttagatcaggaagtagtcatgccccacaagatgtcagtttg tcatatccccaacatcatgttggaaattcaagccctacctccacatctcctagcagcagg tacgaaaggactatggataacatgacttcatcagcacagataattcagagatctttctct tctccagaaaattttcagagacaagcagtggtagtttcttcaaacctgtatactgatcag ctgtctcatgaatacgggagaaaacctgcaggtctccgagagttctctccctgtgcaact tatcgcttttcagtgctgtgtaccggccattccagctgcctttgtttcgctggactctca gctgtgtcttctgaactcactatagggctgcacatatctttggcccagaacacagagaag tttgaggagatgtgtcagcagctggaggagctgctggttgggtgctgccacatgccaaca aggcacaggagcttcaccccactgccaccaccactacagacccatgaggagtactgccag gctaccgccaatattcacataaggcccaagcgactattgcgcctgcgccagcgccggctg cgagactggggccgtggctgctggtcccgggtgatgctaggcggctccctgggctccagg ctgttgcggggtgtaggtgggagtcacggacggttcggggcccgaggtgtccgcgaaggt ggcgcagccatggcggcaggggagagcatggctcagcggatggtctgggtggacctggag atgacaggattggacattgagaaggaccagattattgagatggcctgtctgataactgac tctgatctcaacattttggctgaaggtcctaacctgattataaaacaaccagatgagttg ctggacagcatgtcagattggtgtaaggagcatcacgggaagtctggccttaccaaggca gtgaaggagagtacaattacattgcagcaggcagagtatgaatttctgtcctttgtacga cagcagactcctccagggctctgtccacttgcaggaaattcagttcatgaagataagaag tttcttgacaaatacatgccccagttcatgaaacatcttcattatagaataattgatgtg agcactgttaaagaactgtgcagacgctggtatccagaagaatatgaatttgcaccaaag aaggctgcttctcatagggcacttgatgacattagtgaaagcatcaaagagcttcagttt taccgaaataacatcttcaagaaaaaaatagatgaaaagaagaggaaaattatagaaaat ggggaaaatgagaagaccagagattttgtggatctgaggacagactccagtaactaccta cctattagagtcatgcaatggaacaacttcatccaagctcttggaaagacaaagacaact ttgtacaatgccctgttgaaacactcaaggggaagatggtcatttggcagtgatgatgtg gccccgattgcagaattgagttggaggctgaagcatgtgagctctgatccttgccagttc ctgtatcaagtaagacttacagttatcattggatttagccattcagcagtgattgatgac tttgatgagagcagttctggtggagggatgcggacaaagcctgattga >gi568815587f:114339529_114549906|GENSCAN_predicted_peptide_4|237_aa MTTDPPEIQITIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPITGSEIE AIINSLPTKKSPGPDGFTAEFYQRYKEELAFAFTHHSLTQSEQLPVLQAPFMRRHPDLCH PKPMPPPVQQHTQSPAGPPHTLTNCLASIAVVNAHQGSSHVVFCQFCELFQYMSHEASSG SVRVQQEMDVSVKLGSHSHGPGPLREHVEASSSRRLLGYAGTGTIQHLSLKPERNQY >gi568815587f:114339529_114549906|GENSCAN_predicted_CDS_4|714_bp atgaccactgatcctccagaaatacaaattaccatcagagaatactataaacacctctat gcaaataaactagaaaatctagaagaaatggataaattcctcgacacatacaccctccca agactaaaccaggaagaagttgaatctctgaatagaccaataacaggctctgaaattgag gcaataattaatagcttaccaaccaaaaaaagtccgggaccagatggattcacagccgaa ttctaccagaggtacaaagaggagctggccttcgcatttactcaccactcactgactcag tcagagcaacttccagtcctgcaagctccatttatgagaaggcatccagacctgtgccac cccaagccaatgccacctccagtgcaacagcacacacagtctccagcagggcccccccac accctcaccaactgccttgcctctatcgctgtggtgaacgcccaccagggaagcagccat gttgtcttctgtcagttttgtgaactctttcagtacatgtcacatgaagcttcctctggg tcagtcagggtccagcaggaaatggatgttagtgtcaaacttggcagccactcacatggg ccagggcctctcagggaacacgttgaggcctcaagcagtagaaggcttcttgggtatgca ggtactggcacaattcagcacttgagtttgaaaccagagcgcaatcagtattga >gi568815587f:114339529_114549906|GENSCAN_predicted_peptide_5|590_aa MPRPASAHARCAASTVLHPLSGTPHEMNQVPQLEIQKSPVFCITHAGNCRLELFLFCHLG STPLECIVYNLNAFEFLWSALNLSISVHYWNNSAKSLFPKTSLIPLKPLTETELRIKEII EKLDQQIPPRPFTHVNTTTSATHSTATILNPRDTYCRGDQLDILLEVRDHLGQRKQYGGD FLRARMSSPALTAGASGKVMDFNNGTYLVSFTLFWEGQVSLSLLLIHPSEGASALWRARN QGYDKIIFKGKFVNGTSHVFTECGLTLNSNAELCEYLDDRDQEAFYCMKPQHMPCEALTY MTTRNREVSYLTDKENSLFHRSKVGVEMMKDRKHIDVTNCNKREKIEETCQVGMKPPVPG GYTLQGKWITTFCNQVQLDTIKINGCLKGKLIYLLGDSTLRQWIYYFPKVVKTLKFFDLH ETGIFKKHLLLDAERHTQIQWKKHSYPFVTFQLYSLIDHDYIPREIDRLSGDKNTAIVIT FGQHFRPFPIDIFIRRAIGVQKAIERLFLRSPATKVIIKTENIREMHIETERFGDFHGYI HYLIMKDIFKDLNVGIIDAWDMTIAYGTDTIHPPDHVIGNQINMFLNYIC >gi568815587f:114339529_114549906|GENSCAN_predicted_CDS_5|1773_bp atgcctcgccctgcttcagctcatgcacggtgtgctgcatccactgtcctgcacccactg tccggcactccccatgagatgaaccaagtacctcagttggaaattcagaaatcacccgtc ttctgcatcactcatgctgggaactgtagactggagctgttcttattctgccatcttggc tccaccccgctcgaatgtattgtctataatttaaatgcttttgagtttctttggtctgct ctaaacttatccatctctgtccattactggaacaactccgcaaagtccttattccctaaa acatcactgataccattaaagccactaacagagactgaactcagaataaaggaaatcata gagaaactagatcagcagatcccacccagacctttcacccatgtgaacaccaccaccagt gccacacacagcacagccaccatcctcaaccctcgagatacatactgcaggggagaccag ctggacatcctactggaggtgagggaccacttgggacagaggaagcaatatggtggggat ttcctgagggccaggatgtcctccccagcactgacggcaggtgcttcaggaaaggtgatg gacttcaacaatggcacctacctggtcagcttcactctgttctgggagggccaggtctcc ctgtctctgctgctcatccaccccagtgaaggggcgtcggctctctggagggcaaggaac caaggctatgataaaattattttcaaaggcaaatttgttaatggcacctctcatgtcttc actgaatgtggcctgaccctaaactcaaatgctgaactctgtgaatatctggatgacaga gaccaagaagccttctattgtatgaagcctcaacacatgccctgtgaggctctgacctac atgaccacccggaatagagaggtatcttatcttacagacaaggaaaacagccttttccac aggtccaaagtgggagttgaaatgatgaaggatcgtaaacacattgatgtcactaattgt aacaagagagaaaaaatagaagagacatgccaagttggaatgaagcctcctgtccctggt ggttatactttacaaggaaaatggataacaacattttgcaaccaggttcagttagacaca attaagataaatggctgtttgaaaggcaaactcatttacctcctgggagactctacacta cgtcagtggatctactacttccccaaagttgtaaaaacactgaagttttttgatcttcat gaaactggaatctttaagaaacatttgcttctggatgcagaaagacacactcagattcaa tggaaaaaacatagctatcccttcgtcactttccagctctactctctgatagatcatgat tatatccctcgggaaattgaccggctatcaggtgacaaaaacacagccatcgtcatcacc tttggccagcactttagaccatttcccattgacatttttattcgcagggccatcggtgtt caaaaggctattgaaagactgttcctaagaagcccagccactaaagtgattattaagaca gaaaacatcagggagatgcacatagagacagagaggtttggagacttccatggttatatt cactatcttatcatgaaggatattttcaaagacctcaacgtgggcatcattgatgcctgg gacatgaccattgcatatggcactgacactatccacccacctgatcatgtgattggaaat cagattaacatgttcttaaactacatttgctaa >gi568815587f:114339529_114549906|GENSCAN_predicted_peptide_6|151_aa MGKKQSRKTGNSKNQSASSPPKERSSSPATEQSWTENDFDELREEGFRRSNYSELKEEVR INGKEVKNFEKKLDEWITRIINAEKSLKDLMELKTTARELRDKCTSFSNRCDQLEERVSV MEDEMNEMKHEEKFGEKRIKRNKASKKYGTM >gi568815587f:114339529_114549906|GENSCAN_predicted_CDS_6|456_bp atggggaaaaaacagagcagaaaaaccggaaactctaaaaatcagagcgcctcttctcct ccaaaggaacgcagctcctcaccagcaacagaacaaagctggacggagaatgactttgac gagttgagagaggaaggcttcagaagatcaaactactctgagctaaaggaggaagttcga atcaatggcaaagaagttaaaaactttgaaaaaaaattagatgaatggataactagaata atcaatgcagagaagtccttaaaggacctgatggagctgaaaaccacagcacgagagcta cgtgacaaatgcacaagcttcagtaaccgatgtgatcaactggaagaaagggtatcagtg atggaagacgaaatgaatgaaatgaagcatgaagagaagtttggagaaaaaagaataaaa agaaacaaagcatccaagaaatatgggactatgtga >gi568815587f:114339529_114549906|GENSCAN_predicted_peptide_7|323_aa MNIDAKIRNKILANRIQQHIKKLIHRDQMVFNPGMQGWFNICKSINVIQHINRTKDKNQM IISIDAERAFDKIQQPFMLKTLNKLGIDGTYLKIIRAIYDNPTANIILNGQKLGAFPLKT GTRQGCPLSTLLFNIGLEVLAKKMRQEKGIKGIQLGKEEVRLSLFADDMIVYLENPIVSA PNLLKLISNFSKVSGYKINVQKSRAFLYTSNRQTESQIMSELPFTIASKRIKYLGIQLTR DVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRISIMKMAILPKIAMFNSERNAKNKYL KAKDSEQKQASFGGKMKLGRRCH >gi568815587f:114339529_114549906|GENSCAN_predicted_CDS_7|972_bp atgaacattgatgcaaaaatccgcaataaaatactggcaaaccgaatccagcaacacatc aaaaagcttatccaccgtgatcaaatggtcttcaaccctgggatgcaaggctggttcaac atatgcaaatcaataaatgtaatccagcatataaacagaaccaaagacaaaaaccagatg attatctcaatagatgcagaaagggcctttgacaaaattcaacaacccttcatgctaaaa actctcaataaattaggtattgatgggacatatctcaaaataataagagctatctatgac aaccccacagccaatatcatactgaatggacaaaaactgggagcattccctttgaaaact ggcacaagacagggatgccctctatcaacactcctattcaacatagggttggaagttctg gccaagaaaatgaggcaggagaaggggataaagggcattcaattaggaaaagaggaagtc agattgtccctgtttgcagatgacatgattgtatatctagaaaaccccattgtctcagcc ccaaatctcctcaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacgagcattcttatacaccagtaacagacaaacagagagccaaatcatgagt gaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagg gatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggat acaaacaaatggaagaacattccatgctcatgggtaggaagaatcagtatcatgaaaatg gccatactgcccaagattgcaatgttcaactctgaacgaaatgccaaaaacaagtacctg aaggctaaagacagtgaacaaaagcaggcaagttttggtgggaagatgaaacttggaaga aggtgccattaa