GENSCAN 1.0 Date run: 6-Nov-116 Time: 20:38:03 Sequence gi568815597r:45229309_45434512 : 205204 bp : 43.86% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 13644 13683 40 -1.16 1.01 Sngl + 56283 56576 294 2 0 88 54 270 0.984 19.30 1.02 PlyA + 57162 57167 6 1.05 2.00 Prom + 57697 57736 40 -4.96 2.01 Sngl + 57790 58692 903 0 0 49 49 247 0.992 13.12 2.02 PlyA + 59100 59105 6 1.05 3.00 Prom + 60919 60958 40 -2.46 3.01 Init + 64395 64472 78 2 0 69 72 13 0.611 -1.14 3.02 Intr + 66755 66853 99 1 0 100 60 87 0.794 7.41 3.03 Term + 67677 67847 171 2 0 -14 45 136 0.385 -3.07 3.04 PlyA + 69954 69959 6 1.05 4.03 PlyA - 70699 70694 6 1.05 4.02 Term - 75795 75698 98 1 2 72 44 123 0.083 4.43 4.01 Init - 89654 89591 64 2 1 47 111 62 0.419 5.72 4.00 Prom - 94513 94474 40 -4.36 5.00 Prom + 96943 96982 40 -6.26 5.01 Sngl + 97841 98956 1116 1 0 106 49 1075 0.999 101.18 5.02 PlyA + 99176 99181 6 1.05 6.11 PlyA - 101032 101027 6 -0.45 6.10 Term - 102026 101817 210 2 0 85 42 164 0.942 8.79 6.09 Intr - 102248 102112 137 0 2 125 107 114 0.982 17.29 6.08 Intr - 102541 102353 189 2 0 39 82 129 0.989 6.96 6.07 Intr - 102778 102715 64 1 1 123 77 44 0.998 5.09 6.06 Intr - 103002 102858 145 2 1 69 63 124 0.996 8.28 6.05 Intr - 103180 103083 98 1 2 80 62 142 0.984 9.61 6.04 Intr - 103379 103266 114 2 0 57 58 159 0.999 10.44 6.03 Intr - 103526 103455 72 2 0 90 116 52 0.995 7.80 6.02 Intr - 103862 103789 74 0 2 70 62 88 0.917 3.53 6.01 Init - 105197 105083 115 2 1 100 91 72 0.910 9.08 6.00 Prom - 108700 108661 40 -6.46 7.00 Prom + 108871 108910 40 -9.75 7.01 Init + 110945 110996 52 1 1 84 101 79 0.851 10.13 7.02 Intr + 111765 112035 271 1 1 85 100 189 0.818 16.40 7.03 Intr + 112165 112261 97 1 1 22 86 113 0.909 4.51 7.04 Intr + 112641 112799 159 2 0 83 102 178 0.999 18.88 7.05 Intr + 113076 113335 260 2 2 101 77 318 0.997 28.26 7.06 Intr + 113535 113694 160 0 1 78 89 162 0.998 15.29 7.07 Term + 113774 114394 621 1 0 91 44 308 0.992 21.12 7.08 PlyA + 114630 114635 6 -0.45 8.10 PlyA - 114692 114687 6 -3.24 8.09 Term - 116250 115532 719 1 2 61 55 402 0.985 27.85 8.08 Intr - 116686 116569 118 1 1 36 78 124 0.994 6.24 8.07 Intr - 117471 117385 87 0 0 80 92 63 0.977 6.07 8.06 Intr - 117754 117671 84 1 0 -1 83 137 0.913 4.32 8.05 Intr - 118385 118301 85 1 1 63 115 50 0.550 5.02 8.04 Intr - 118692 118610 83 0 2 49 89 84 0.542 3.04 8.03 Intr - 126141 125995 147 0 0 79 86 96 0.817 8.93 8.02 Intr - 156652 156604 49 0 1 116 94 -3 0.001 1.78 8.01 Intr - 192538 192417 122 1 2 105 63 80 0.123 6.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 145292 145423 132 1 0 87 49 108 0.863 4.89 S.002 Term + 185879 186036 158 0 2 48 38 175 0.942 6.60 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:45229309_45434512|GENSCAN_predicted_peptide_1|97_aa MGKKQNRKTGNSKKQSASPPPKEYSSSPAKEQSWMENDFDELREEGFRRSNYSELKEEVR TQGKEVKNLLKKLDKWLTRITNAEKSLKDLMELKTMA >gi568815597r:45229309_45434512|GENSCAN_predicted_CDS_1|294_bp atggggaaaaaacagaacagaaaaaccggaaactctaaaaagcagagcgcctctcctcct ccaaaggaatatagctcctcaccagcaaaggaacaaagctggatggagaatgactttgac gagttgagagaagaaggcttcagacgatcaaactactccgagctaaaggaggaagttcga acccaaggcaaagaagttaaaaaccttttaaaaaaattagacaaatggctaactagaata accaatgcagagaagtccttaaaggacctgatggagctgaaaaccatggcatga >gi568815597r:45229309_45434512|GENSCAN_predicted_peptide_2|300_aa MGDFYTPLSTSDRSTKQKVNKDIQELNSALHQADLIDIYRILHPKSTEYTFFSAPHHTYS KIDHIVGSKALLSKCKRTEIITNCLSDHSAIKLELSIKKLTQILSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNKNKDTTYQNLWDTFKAVCRGKFIALNAHKRKQEKSKIDTLTSQLK EPEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKISESRSWFFEKINKIDRLLARLIK KKREKNQIDTIKNDKGDTTTNPTEIQTTIREYYKHLCANKLENLEEMDKFLDIYTLPRLN >gi568815597r:45229309_45434512|GENSCAN_predicted_CDS_2|903_bp atgggagacttttacaccccactgtcaacatcagacagatcaacgaaacagaaagttaac aaggatatccaggaattgaactcagctctgcaccaagcagacctaatagacatctacaga attctccaccccaaatcaacagaatatacattcttctcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaagcactcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcagcattaagaaactc actcaaatcctctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataatgaaatgaaggcagaaataaagatgttctttgaaaccaacaagaacaaagacaca acataccagaatctctgggacacattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaaaatctaaaatagacaccctaacatcacaattaaaa gaaccagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaag atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatcagtgaatcc aggagctggttttttgaaaagatcaacaaaattgatagactgctagcaagactaataaag aagaaaagagagaagaatcaaatagacacaataaaaaatgacaaaggggataccaccacc aatcccacagaaatacaaactaccatcagagaatactataaacacctctgcgcaaataaa ctagaaaatctagaagaaatggataaattcctcgatatatacaccctcccaagactaaac tag >gi568815597r:45229309_45434512|GENSCAN_predicted_peptide_3|115_aa MERSGCFQEPCKRENWQHLVTGHGYRVLAKQLQQFQDTFRLNNIPGKKRYFCLHHSHLGV GIEAESMDELTQRVFRGRSTEDLGASVELEIQLNWKATEADTESVTAPAAEGLPV >gi568815597r:45229309_45434512|GENSCAN_predicted_CDS_3|348_bp atggaaagaagtggatgctttcaagagccatgtaagagagaaaattggcagcacttggtg actggacacggctacagggtgttagcaaaacagctgcagcagttccaggacaccttcaga cttaacaacattcccgggaaaaagagatacttctgtttacatcactctcacctaggggtg ggaattgaagctgaaagcatggatgagctcacccagagagtgttcagagggagaagcaca gaggacctaggcgcttcagttgagctggaaatccagctcaactggaaggctacagaggca gacaccgaatcagttacggctcctgctgcagaagggcttcctgtctga >gi568815597r:45229309_45434512|GENSCAN_predicted_peptide_4|53_aa MRKGLTEVMGAVPEALWQYSPVTPLLMMNRMLLNPGVAGALNDFDDERLGMES >gi568815597r:45229309_45434512|GENSCAN_predicted_CDS_4|162_bp atgagaaagggcttgactgaagtaatgggggctgtccctgaagccttgtggcagtacagc ccagtcacccctctgctcatgatgaacaggatgctgctgaaccctggggtcgctggggcc ttaaatgactttgatgatgagagactgggcatggagagctag >gi568815597r:45229309_45434512|GENSCAN_predicted_peptide_5|371_aa MAAPALRLCHIAFHVPAGQPLARNLQRLFGFQPLASREVDGWRQLALRSGDAVFLVNEGA GSGEPLYGLDPRHAVPSATNLCFDVADAGAATRELAALGCSVPVPPVRVRDAQGAATYAV VSSPAGILSLTLLERAGYRGPFLPGFRPVSSAPGPGWVSRVDHLTLACTPGSSPTLLRWF HDCLGFCHLPLSPGEDPELGLEMTAGFGLGGLRLTALQAQPGSIVPTLVLAESLPGATTR QDQVEQFLARHKGPGLQHVGLYTPNIVEATEGVATAGGQFLAPPGAYYQQPGKERQIRAA GHEPHLLARQGILLDGDKGKFLLQVFTKSLFTEDTFFLELIQRQGATGFGQGNIRALWQS VQEQSARSQEA >gi568815597r:45229309_45434512|GENSCAN_predicted_CDS_5|1116_bp atggccgcgcccgcccttcgtttgtgccacatcgccttccacgtgcccgccgggcagccc ctagcccggaacctgcagcgcctcttcggcttccagcccctggcttcgcgggaggtggac ggctggcggcagctagccctgcgcagcggcgacgcggtctttttggtgaacgagggcgca gggtctggagagccgctgtacggcctggatccgcgtcacgccgtgcccagcgccacaaac ctgtgcttcgacgtggcggacgccggcgctgcaacccgggagctggcagcgctgggctgc agcgtgcctgtccctcccgttcgcgtgcgggacgcgcagggtgccgccacttacgccgtg gtcagctcgcctgccggcatcctcagcctgaccttgctggagcgcgctggctaccgcgga cccttcctacccggcttcaggcccgtgtcctctgcgcctggccccgggtgggtcagccgc gtggaccacctgaccttggcctgcacccccggcagctcccccacacttttgcgctggttc cacgactgcctgggcttttgccacttgccgctgagcccaggtgaggatcccgagctgggc ctcgaaatgacagcagggtttgggcttgggggactgaggcttacagccctgcaggcccag ccgggcagcattgtccccactcttgttctggctgagtcccttccgggggcgacgacacga caggaccaggtggagcagttcctggcccggcacaaggggccaggcctgcagcacgtgggg ctgtatacgcctaacattgtggaggccactgagggggtggcaactgctggaggccagttc ctggctccccctggggcatactaccagcagccaggaaaggagaggcagatccgagctgca gggcacgagcctcatctgcttgctcgacaggggatcctgctagatggtgataaaggcaag tttctgcttcaggtcttcaccaagtccctttttactgaggacactttcttcctggagctg attcagaggcagggggccactggctttggtcagggcaacatcagagctctgtggcagtcc gtacaggagcaatctgccaggagccaggaagcctaa >gi568815597r:45229309_45434512|GENSCAN_predicted_peptide_6|405_aa MRKPRAAVGSGHRKQAASQEGRQKHAKNNSQAKPSACDVWVSEVMLQQTQVATVINYYTG WMQEVNQLWAGLGYYSRGRRLQEGARKVVEELGGHMPRTAETLQQLLPGVGRYTAGAIAS IAFGQATGVVDGNVARVLCRVRAIGADPSSTLVSQQLWGLAQQLVDPARPGDFNQAAMEL GATVCTPQRPLCSQCPVESLCRARQRVEQEQLLASGSLSGSPDVEECAPNTGQCHLCLPP SEPWDQTLGVVNFPRKASRKPPREESSATCVLEQPGALGAQILLVQRPNSGLLAGLWEFP SVTWEPSEQLQRKALLQELQRWAGPLPATHLRHLGEVVHTFSHIKLTYQVYGLALEGQTP VTTVPPGARWLTQEEFHTAAVSTAMKKALPLLSLLYFLVFPTCST >gi568815597r:45229309_45434512|GENSCAN_predicted_CDS_6|1218_bp atgaggaagccacgagcagccgtgggaagtggtcacaggaagcaggcagccagccaggaa gggaggcagaagcatgctaagaacaacagtcaggccaagccttctgcctgtgatgtgtgg gtctcagaggtcatgctgcagcagacccaggttgccactgtgatcaactactataccgga tggatgcaggaggtgaatcaactctgggctggcctgggctactattctcgtggccggcgg ctgcaggagggagctcggaaggtggtagaggagctagggggccacatgccacgtacagca gagaccctgcagcagctcctgcctggcgtggggcgctacacagctggggccattgcctct atcgcctttggccaggcaaccggtgtggtggatggcaacgtagcacgggtgctgtgccgt gtccgagccattggtgctgatcccagcagcacccttgtttcccagcagctctggggtcta gcccagcagctggtggacccagcccggccaggagatttcaaccaagcagccatggagcta ggggccacagtgtgtaccccacagcgcccactgtgcagccagtgccctgtggagagcctg tgccgggcacgccagagagtggagcaggaacagctcttagcctcagggagcctgtcgggc agtcctgacgtggaggagtgtgctcccaacactggacagtgccacctgtgcctgcctccc tcggagccctgggaccagaccctgggagtggtcaacttccccagaaaggccagccgcaag ccccccagggaggagagctctgccacctgtgttctggaacagcctggggcccttggggcc caaattctgctggtgcagaggcccaactcaggtctgctggcaggactgtgggagttcccg tccgtgacctgggagccctcagagcagcttcagcgcaaggccctgctgcaggaactacag cgttgggctgggcccctcccagccacgcacctccggcaccttggggaggttgtccacacc ttctctcacatcaagctgacatatcaagtatatgggctggccttggaagggcagacccca gtgaccaccgtaccaccaggtgctcgctggctgacgcaggaggaatttcacaccgcagct gtttccaccgccatgaaaaaggcactacctttgttgtctttgttgtacttccttgtgttt cctacatgttctacatga >gi568815597r:45229309_45434512|GENSCAN_predicted_peptide_7|539_aa MAADSDDGAVSAPAASDGGVSKSTTSGEELVVQVPVVDVQSNNFKEMWPSLLLAIKTANF VAVDTVRVGKQGGQVVVKGLVLEACHNSPFTYPQELSGLGDRKSLLNQCIEERYKAVCHA ARTRSILSLGLACFKRQPDKGEHSYLAQVFNLTLLCMEEYVIEPKSVQFLIQHGFNFNQQ YAQGIPYHKGNDKGDESQSQSVRTLFLELIRARRPLVLHNGLIDLVFLYQNFYAHLPESL GTFTADLCEMFPAGIYDTKYAAEFHARFVASYLEYAFRKCERENGKQRAAGSPHLTLEFC NYPSSMRDHIDYRCCLPPATHRPHPTSICDNFSAYGWCPLGPQCPQSHDIDLIIDTDEAA AEDKRRRRRRREKRKRALLNLPGTQTSGEAKDGPPKKQVCGDSIKPEETEQEVAADETRN LPHSKQGNKNDLEMGIKAARPEIADRATSEVPGSQASPNPVPGDGLHRAGFDAFMTGYVM AYVEVSQGPQPCSSGPWLPECHNKVYLSGKAVPLTVAKSQFSRSSKAHNQKMKLTWGSS >gi568815597r:45229309_45434512|GENSCAN_predicted_CDS_7|1620_bp atggccgccgacagtgacgatggcgcagtttcagctcccgcagcttccgacggtggtgtc agcaaaagcacaacatctggggaggagctagtagtccaggttcccgtagtggatgtgcaa agcaacaacttcaaggagatgtggccatccctcctgctagccataaagacagctaatttc gtggctgtggacacggtgagagttgggaaacaaggagggcaggtggttgtgaaggggctg gtgctagaggcctgtcacaactctccctttacctacccacaggagctgagtgggcttggg gacaggaagagtttgctgaaccagtgcattgaggaacgttacaaggccgtgtgtcatgct gccaggacccgttctatcctttccctgggcctcgcctgcttcaagcggcagccagacaag ggtgaacattcctatctggctcaagtgttcaatctcactctgctgtgcatggaggagtat gtcatagaaccaaagtctgtgcagttcctgatacagcatggcttcaacttcaaccagcag tatgcccaaggcatcccctaccataagggcaatgacaagggtgatgagagccagagccag tcagtacggaccctattcctggagctaatccgagcccgccggcccctggtgctacacaat ggccttatagacttggtgttcctgtaccagaacttctatgcacacctccctgagagtctg ggaaccttcaccgctgacctgtgtgagatgttcccagcaggcatttatgacaccaaatat gctgctgagtttcatgcccgtttcgtggcctcctacttagaatatgccttccggaaatgt gaacgggaaaatgggaagcagcgggcagctggcagcccacaccttaccctggagttctgc aactatccttccagcatgagggaccatattgattaccgctgctgcctgcccccagcaacc caccgtcctcatcccaccagcatctgtgacaacttctcggcttatggctggtgccccctg ggaccacagtgtcctcagtctcacgatattgaccttatcattgacactgatgaggctgcg gcagaggacaagcggcgacggcgacgacgtagggaaaaacggaagagggctttattgaac ctaccggggacacagacctctggggaagctaaggatggtcctcccaagaagcaggtctgt ggggatagcatcaagcctgaagaaaccgagcaggaggtggctgccgatgaaactaggaac ctgcctcactccaagcaaggcaacaaaaatgacttagagatggggattaaggcagcaagg cctgaaatagctgatagagctacctcagaagtgccagggagccaagccagtcctaaccca gtgcctggggatggattgcaccgggctggttttgatgcctttatgacaggttatgtgatg gcctatgtggaagtgagccagggaccgcagccctgcagctctggaccctggctccctgaa tgccacaataaggtatatttgagtggcaaagctgtacccctcacagtggccaagagccag ttctctcgttcctccaaagcccacaatcagaagatgaagctcacttggggcagtagctga >gi568815597r:45229309_45434512|GENSCAN_predicted_peptide_8|497_aa VRHRASGQVMALKMNTLSSNRANMLKEVQLMNRLSHPNILRFMGVCVHQGQLHALTEYIN SGNLEQLLDSNLHLPWTVRVKLAYDIAVGLSYLHFKGIFHRDLTSKNCLIKRDENGYSAV VADFGLAEKIPDVSMGSEKLAVVGSPFWMAPEVLRDEPYNEKADVFSYGIILCEIIARIQ ADPDYLPRTENFGLDYDAFQHMVGDCPPDFLQLTFNCCNMDPKLRPSFVEIGKTLEEILS RLQEEEQERDRKLQPTARGLLEKAPGVKRLSSLDDKIPHKSPCPRRTIWLSRSQSDIFSR KPPRTVSVLDPYYRPRDGAARTPKVNPFSARQDLMGGKIKFFDLPSKSVISLVFDLDAPG PGTMPLADWQEPLAPPIRRWRSLPGSPEFLHQEACPFVGREESLSDGPPPRLSSLKYRVK EIPPFRASALPAAQAHEAMDCSILQEENGFGSRPQGTSPCPAGASEEMEVEERPAGSTPA TFSTSGIGLQTQGKQDG >gi568815597r:45229309_45434512|GENSCAN_predicted_CDS_8|1494_bp gtacgacaccgagcttctggtcaggtgatggctcttaagatgaacacattgagcagtaac cgggcaaacatgctgaaagaagtacagctcatgaatagactctcccatcccaacatcctt aggttcatgggtgtatgtgttcatcaaggacaattgcatgcacttacagagtatatcaac tccgggaacctggaacagttgctagacagtaacctgcatttgccttggactgtgagggta aaactggcctatgacatagcagtgggcctcagctaccttcacttcaaaggcatttttcat cgggacctcacatctaagaactgcctgataaagagggatgagaatggttactctgcagtg gtagctgactttggcctggctgagaagatccccgatgtcagcatggggagtgagaagctg gccgtggtgggttccccattctggatggcacctgaggttctccgagatgagccctataat gaaaaggcagatgtgttctcttatggtatcatcctctgcgagatcatcgcccgcatccag gccgatccggactatcttccccgcacagagaatttcgggctggactatgatgctttccag cacatggtgggagactgtcccccagattttctgcaacttactttcaactgctgtaacatg gatcccaaactgcgcccatcttttgtggagattgggaagaccctggaggaaattctgagc cgcctacaggaagaagagcaggagagggataggaagctgcagcccacagccaggggactc ttggagaaagcacctggggtgaagcgactaagctcactggatgacaagatcccccacaag tcaccatgcccaagacgtaccatctggctgtctcgaagccagtcagatatcttttcccgt aagcccccacgtacagtgagtgtcttggacccatactaccggccacgagatggtgctgcc cgcacccccaaagtcaacccttttagtgctcgccaggacctcatggggggcaagatcaag ttttttgacctgcccagcaagtctgtcatctctctggtatttgacctggatgcaccaggg cccggaactatgcccctggctgactggcaggagcccctggccccacctattcgccggtgg cgttccttgcctggttcgcctgagttcttgcatcaagaggcttgtccatttgtgggccgg gaagaatcgctatctgatgggcccccaccacgcctaagtagtctcaagtacagagttaaa gagatcccaccattccgggcatctgccctaccagctgctcaagcccatgaggctatggac tgctccattctccaggaagaaaatggttttgggtccaggccccaggggaccagtccatgc cctgcgggtgcttctgaggagatggaggtagaagaaaggccagcaggctcaactccagcc accttctccacctcaggcataggcctgcaaacccagggaaagcaggatgggtga