GENSCAN 1.0 Date run: 6-Nov-116 Time: 03:11:24 Sequence gi568815597r:90612315_90817195 : 204881 bp : 39.98% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 123 118 6 1.05 1.05 Term - 6847 6620 228 1 0 68 48 221 0.684 11.75 1.04 Intr - 21984 21825 160 2 1 77 27 147 0.013 6.57 1.03 Intr - 45656 45505 152 2 2 141 4 130 0.003 7.94 1.02 Intr - 61851 61751 101 1 2 102 73 29 0.018 1.71 1.01 Init - 85311 85224 88 0 1 39 83 16 0.172 -2.94 1.00 Prom - 86748 86709 40 -3.15 2.13 PlyA - 86900 86895 6 1.05 2.12 Term - 94786 94359 428 2 2 83 48 180 0.901 7.98 2.11 Intr - 98847 98707 141 1 0 131 50 165 0.926 16.40 2.10 Intr - 99791 99721 71 1 2 90 100 53 0.907 4.61 2.09 Intr - 100310 100002 309 1 0 94 24 354 0.960 24.10 2.08 Intr - 102442 102217 226 2 1 58 110 333 0.878 28.62 2.07 Intr - 104883 104257 627 1 0 -29 113 589 0.446 40.95 2.06 Intr - 105605 105209 397 2 1 -1 -37 535 0.797 25.33 2.05 Intr - 114597 114278 320 1 2 25 21 228 0.156 4.55 2.04 Intr - 115154 115008 147 0 0 24 82 103 0.644 2.59 2.03 Intr - 117701 117510 192 0 0 53 86 114 0.516 6.24 2.02 Intr - 117898 117808 91 1 1 86 74 68 0.350 3.85 2.01 Init - 120037 119972 66 1 0 63 55 75 0.434 2.82 2.00 Prom - 120361 120322 40 -6.45 3.00 Prom + 124996 125035 40 -7.05 3.01 Init + 128600 128765 166 1 1 71 -15 199 0.177 7.74 3.02 Term + 131866 131951 86 2 2 114 43 60 0.135 0.94 3.03 PlyA + 133947 133952 6 1.05 4.04 PlyA - 134235 134230 6 1.05 4.03 Term - 136547 136444 104 0 2 93 34 87 0.681 1.26 4.02 Intr - 146263 146175 89 0 2 97 110 29 0.625 4.70 4.01 Init - 153081 152900 182 0 2 41 69 136 0.654 5.70 4.00 Prom - 154232 154193 40 -3.05 5.00 Prom + 160175 160214 40 -3.65 5.01 Sngl + 169330 169599 270 0 0 55 39 198 0.589 6.66 5.02 PlyA + 171957 171962 6 1.05 6.04 PlyA - 172884 172879 6 1.05 6.03 Term - 177173 177123 51 2 0 103 54 83 0.594 2.85 6.02 Intr - 178379 178270 110 0 2 72 93 73 0.343 5.28 6.01 Init - 197953 197830 124 1 1 79 80 153 0.984 12.00 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 45656 45474 183 2 0 141 39 158 0.916 12.76 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:90612315_90817195|GENSCAN_predicted_peptide_1|242_aa MCLFNKNGSEGRCQELTPLFIGSYILNCIRSASGNPTVMGRILRWPSTFAGICTYRIPSS RALMGTTQTVSSLSYSPREEKNDHQWPMAVAAKPHQEPASCSGENAALPAHGFRCSCSGQ FHTSSHSPCADNATLHMYAVRLPAFFPRAFFDATKACTIPMQVQPNGAMEIMLDKKQIRA IFLFEFKMGCKAAESTHNNNTFGPGTANKRTVQWWFKKFCKGDESLEDEEHCGQPSEVDN DH >gi568815597r:90612315_90817195|GENSCAN_predicted_CDS_1|729_bp atgtgtctctttaataagaatggaagtgaaggcagatgtcaggaactgactccacttttt attggatcctatattctaaattgtattagatctgcttctggaaaccccacagtgatgggt agaattctaagatggccttcaacattcgctggcatatgcacctatagaattccctcctct cgagcattgatgggaaccactcagactgtgtcctctctctcttactctcccagagaagag aagaacgatcatcaatggccaatggcagttgcagcgaagccacaccaagagccggcttca tgctcaggagagaacgctgcacttcctgctcatggtttccgctgtagctgcagtggacag ttccacacaagttcacactcaccttgtgctgacaatgccacacttcacatgtatgctgta cgtcttcctgctttcttccccagggctttctttgatgccacaaaagcctgcacaattccc atgcaagtgcagcccaatggggctatggaaataatgttagacaaaaagcaaattcgagca atttttttattcgagttcaaaatgggttgtaaagcagcagagtcaactcacaacaacaac acatttggcccgggaactgctaacaaacgtacagtgcagtggtggttcaagaagttttgc aaaggagacgagagccttgaagatgaggagcattgtggccagccatcggaagttgacaat gaccattga >gi568815597r:90612315_90817195|GENSCAN_predicted_peptide_2|1004_aa MPPSVITQEKAHEEDPRNQEDGKCCRHLLQITDKKLCCIDERKYRRLSVSPSALSMLDIS SQIFYTKATNLMSGSGKALINIIYECCLIQLQRPRNQNKQIFEGFSGTKRQRPREKGSKL DRKPTVLEAQSSAFKWPQAQGEGAVQKILCFCSLWISRACRGTVLAVPRFRGLLAPRLAC HLLPEAQLGAPRWRESAPCLGRLTQAAKEQEGEPLEEAQAAMRRAGRKSIFIGLAFLLTE LFANDIYLTVFDSSYEHISGKQDLYSLRLKLPRPESPGIASPPLPPSVPAPSRRPPLPPP PRVAATAAAVSAPQLSKKVQKLGYGAAEWRAAATSGAAGEFRGRAAARASRSLHLPRPPP PPLPPGARLWADPGSGGGGGFGVASRRLRLRLRLRLRLRLRLRLAMTMEGASGSSFGIDT ILSSASSGSPGMMNGDFRPLGEARTADFRSQATPSPCSEIDTVGTAPSSPISVTMEPPEP HLVADATQHHHHLHHSQQPPPPAAAPTQSLQPLPQQQQPLPPQQPPPPPPQQLGSAASAP RTSTSSFLIKDILGDSKPLAACAPYSTSVSSPHHTPKQESNAVHESFRPKLEQEDSKTKL DKREDSQSDIKCHGTKEEGDREITSSRESPPVRAKKPRKARTAFSDHQLNQLERSFERQK YLSVQDRMDLAAALNLTDTQVKTWYQNRRTKWKRQTAVGLELLAEAGNYSALQRMFPSPY FYHPSLLGSMDSTTAAAAAAAMYSSMYRTPPAPHPQLQRPLVPRVLIHGLGPGGQPALNP LSSPIPGTPHPRNRAGNLPHSSVTGENADPTQSATRVFLGQRGSEALGSDDNAGRGWAAG GRSSSLAAVLLPAAERRRGRTASRSALGEPPPRPPQAEQKRSRGSSPAGRAGPRPCSPPR LSARALSLQIWVSPSSCYYLVWDSESPLRELRRKKRTFARARLNLIYDKRALSLCGADGF PPPPTPIPAVRLHSLVKRPSVWLSSRLLLTIRIHVAARDLARRP >gi568815597r:90612315_90817195|GENSCAN_predicted_CDS_2|3015_bp atgcctccttctgtcatcacacaggaaaaagcacatgaagaggatcccaggaatcaggaa gatgggaaatgctgcagacacttattgcaaattactgacaaaaagctctgctgtattgat gagaggaagtacaggcggctttcggtgtccccgtcagctttaagcatgttagacatttct tcccagattttttacacgaaggccactaatttgatgtcaggctccggcaaagctctcatc aatattatctacgagtgctgcctaatacagctccaacgcccgaggaatcaaaataagcaa atctttgaaggtttttcaggcaccaaaaggcagagaccgagagagaaaggctccaaactt gaccgaaaacctactgtcttggaggcacagagctctgccttcaaatggccacaggcccag ggagagggagcggttcagaaaatactgtgcttctgctccctgtggatttccagggcctgc cgagggactgtgctggccgttccgcgtttccgcggcctccttgccccacggctggcctgt catcttctgcccgaggctcagctcggggctccaaggtggcgggagtcagcaccatgcctg ggacgtctgacccaagcagcaaaggagcaggaaggagaacctctggaggaagcgcaggca gccatgcggagggcggggaggaaaagcatatttattggattggcatttctcttaacagag ctcttcgctaatgatatttacttaactgtatttgacagcagttacgaacacatctccggt aaacaggatctatactccttgcgcctaaaactgcctcggccggagtcgcccggcatcgcg tctccgccgcttcctccttctgtgcctgctccttctcgtcgtcctcctcttcctcctcct cctcgggttgccgcaaccgctgcggccgtgtctgccccgcaactttccaagaaagttcag aaactcggctacggggctgccgagtggcgcgcggctgccacctcgggagccgcgggggag ttccgagggcgcgcggccgcccgggcttcgcggtcgctgcacctgccgcgtccgccgccg ccaccgctgccgcccggtgccaggctgtgggcagaccccggcagcggcggcggcggcggc tttggcgtagcttcccgccgcctgcgccttcgccttcgcctgcgccttcgccttcgcctt cgccttcgcctggcaatgacaatggaaggggccagcgggtcgagttttggaatagacacg attttgtccagtgccagttcaggcagcccaggcatgatgaatggagatttccgcccgctc ggtgaggccaggaccgcggattttaggagtcaggccaccccatctccctgttcggagatt gataccgtagggacggcgccttcttctcctatctcagtcaccatggagcccccggagccg catctggtagcagacgcgacccagcatcatcaccacctccaccacagccagcagccgccg ccgccggccgcggccccgacgcaaagtttgcagcctttgccccaacagcagcagccgctg ccgccacagcagccgccgccgccgcccccccagcagctgggctcggccgcctcggccccc aggacttccacgtcttcttttttaattaaggacatcttgggcgacagcaaacctctggcg gcatgtgcaccctacagcaccagcgtatcctctccccaccacaccccgaagcaggagagc aacgcagtgcacgagagcttcaggccaaagctcgagcaggaggacagcaagaccaaactc gacaagcgggaggattcccagagcgacatcaaatgccacgggacaaaggaggaaggagac cgggagattacgagtagccgtgagagtccccctgtgagagccaagaagcctcgaaaagca aggacagctttttccgaccaccagctcaatcaactggagcgtagctttgagcggcagaag tacctgagcgtgcaggatcgcatggacctggctgcagcgctcaacctcactgacacccaa gtcaagacctggtaccagaaccgcaggaccaagtggaagcggcagacagcggtgggcctg gagttgctggccgaggcagggaactactcggcgctgcagaggatgtttccatcgccttat ttctatcacccaagcctgctgggcagcatggacagcactacggcggcggcggctgccgct gccatgtacagcagcatgtaccggactcctccagcaccccatccccagctgcagcggccc ctggtgccccgtgtgctcatccacggcctagggcctgggggacagccagcccttaatcca ttgtccagccccatcccaggcaccccacacccccgaaacagggctggaaatctcccccac agcagtgtgactggtgaaaatgctgaccccacacagagtgcaaccagggtcttcctgggg cagcgagggtccgaagcgctgggcagcgatgacaacgccggcaggggctgggcggcaggc ggacgctcctcgagcctcgcagctgttctgctgcccgcggctgagcgccggcgaggtcgg acagcctctcggagcgcgctcggcgagccgccgcctcggccaccgcaggcagagcagaag cgctcccgaggcagcagccctgcaggtcgcgctgggcccaggccctgctccccaccgcgg ctctcagcccgcgccctctccctgcagatttgggtctctcccagctcttgctactatctg gtctgggacagcgagtcgcccctgcgggagctccgccgaaaaaagaggaccttcgcgaga gcgagattgaacttgatttacgataaaagggccctgtcactttgtggagctgatggattc ccgcccccacccacccccattcccgccgtcagacttcattctttagtgaagaggccctca gtgtggctgtcctcccgcctcctcctaacgattcgcatacacgtcgcagccagggatttg gcgcggcggccttga >gi568815597r:90612315_90817195|GENSCAN_predicted_peptide_3|83_aa MKIGLQLAHRAPLQEQEKGAPFELTLPGFDVYGLMSRTRVGFQCLLTLRVSTFDHNGGTS QKPLRFQMLFRGQRGRDVPFSGS >gi568815597r:90612315_90817195|GENSCAN_predicted_CDS_3|252_bp atgaagatagggctgcaactggcccacagagcacctttgcaagaacaagagaaaggtgca ccgtttgagctgactctgcctgggttcgatgtatacggcttgatgagcagaacaagagtt ggattccagtgcctacttacccttcgagtgagcacctttgaccacaatggtggcacaagc caaaaacccttgcggtttcagatgctgtttagaggacaacggggaagagatgtgcctttt tctgggtcttga >gi568815597r:90612315_90817195|GENSCAN_predicted_peptide_4|124_aa MPDYKWKRRPNRTSSKEETTLGCKALKYQLIFLFCENASRDCKEKTFFASVDLETKGPKD RYCSIEHFLMNLLCTNLRVFFLGNPDLQQTGKELQRSKGTIGVTANRNLNCSKSSSSGGL LSDV >gi568815597r:90612315_90817195|GENSCAN_predicted_CDS_4|375_bp atgccagattataaatggaaaaggaggcccaatagaactagcagtaaagaagagactacg cttggttgtaaagctttgaaatatcaactcatatttctattttgtgaaaatgcatccaga gactgcaaagagaaaaccttctttgcctctgtggacttggaaaccaaaggacccaaggac agatattgttccatagagcacttcctgatgaatctcttatgcacaaatctcagagtcttt ttcctggggaaccctgacctacaacagacagggaaagaattacagaggagcaaagggaca attggtgtcacagcaaacagaaacctcaactgttccaagtcctcatcttcaggaggcctt ctctctgatgtatga >gi568815597r:90612315_90817195|GENSCAN_predicted_peptide_5|89_aa MLGWRSSLLLRVTKKQGLAKSRGRPKAQSAPTKLDPGKSDTGATTRVTRQPLARAFYSET AGIKEAILVNWSIAQYLPMDRRVTGFEAM >gi568815597r:90612315_90817195|GENSCAN_predicted_CDS_5|270_bp atgctggggtggagaagctcattactgctcagagtaaccaagaaacagggcctggcaaag agtagagggagaccaaaggcccagagtgctcccaccaaattggatcctggaaagtctgac accggagcaactactagagtcactcggcagccgcttgctcgggctttttacagtgaaact gcagggataaaggaggccattctggtgaactggagcatcgcccagtatttgccaatggat agaagggtcacagggtttgaggccatgtag >gi568815597r:90612315_90817195|GENSCAN_predicted_peptide_6|94_aa MPVWFDLLPLALALAECWLGERPWMKDVLAAGTLTLTLEPGGSLTRSDGFASGSFSCAHT SLSCCFVKKVSASPSAMTPCEDVPASPSIMTEVS >gi568815597r:90612315_90817195|GENSCAN_predicted_CDS_6|285_bp atgccagtctggtttgacttgcttcctctggccctggcgctggctgagtgctggctaggg gagaggccctggatgaaagatgtcctggcagcaggaacattaacactcactcttgagcca ggagggagtctcacgagatctgatggatttgcaagtggcagtttttcctgtgctcacact tctctctcctgctgctttgtgaagaaggtgtctgcttccccttctgccatgactccgtgt gaagatgtgcctgcttcgccttccatcatgactgaagtttcctga