GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:21:31 Sequence gi568815575r:69061310_69263073 : 201764 bp : 49.12% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 2893 2888 6 1.05 1.08 Term - 2998 2896 103 0 1 24 53 127 0.013 0.55 1.07 Intr - 20177 19869 309 1 0 60 63 206 0.065 10.73 1.06 Intr - 22928 22825 104 2 2 61 80 29 0.054 -1.63 1.05 Intr - 33703 33629 75 1 0 52 85 82 0.301 4.01 1.04 Intr - 35144 35019 126 2 0 40 70 61 0.228 0.38 1.03 Intr - 41748 41614 135 0 0 108 61 6 0.217 0.66 1.02 Intr - 42895 42839 57 1 0 116 47 31 0.533 0.88 1.01 Init - 44868 44737 132 0 0 72 87 111 0.760 7.56 1.00 Prom - 59874 59835 40 -5.16 2.00 Prom + 61269 61308 40 -7.46 2.01 Init + 62568 62660 93 2 0 84 71 37 0.263 0.03 2.02 Intr + 65883 65959 77 2 2 105 95 12 0.770 1.91 2.03 Term + 67386 67896 511 0 1 90 55 215 0.466 12.25 2.04 PlyA + 69258 69263 6 1.05 3.07 PlyA - 70366 70361 6 1.05 3.06 Term - 84590 84573 18 2 0 95 54 16 0.144 -2.78 3.05 Intr - 88954 88862 93 1 0 102 89 23 0.478 3.96 3.04 Intr - 89499 89474 26 1 2 76 105 18 0.425 0.04 3.03 Intr - 101825 100069 1757 1 2 -12 14 1334 0.000 103.32 3.02 Intr - 104190 104063 128 0 2 103 97 139 0.005 15.78 3.01 Init - 118247 118209 39 2 0 65 99 31 0.212 2.09 3.00 Prom - 118402 118363 40 -4.06 4.00 Prom + 124442 124481 40 -3.76 4.01 Init + 127646 127670 25 1 1 86 99 25 0.196 2.61 4.02 Intr + 136246 136420 175 2 1 44 68 86 0.057 1.20 4.03 Intr + 143023 143140 118 1 1 96 80 18 0.254 2.27 4.04 Intr + 144984 145138 155 2 2 61 99 61 0.349 3.37 4.05 Intr + 154123 154308 186 1 0 76 93 55 0.441 3.60 4.06 Intr + 157945 158053 109 1 1 123 8 94 0.659 5.19 4.07 Intr + 160212 160305 94 2 1 110 89 -5 0.510 1.34 4.08 Intr + 175464 175601 138 1 0 96 58 112 0.643 9.44 4.09 Term + 180101 180219 119 0 2 81 43 34 0.056 -3.10 4.10 PlyA + 184146 184151 6 1.05 5.00 Prom + 184247 184286 40 -4.96 5.01 Sngl + 185638 186606 969 0 0 60 48 511 0.742 41.03 5.02 PlyA + 187040 187045 6 1.05 6.00 Prom + 188535 188574 40 -4.36 6.01 Init + 188827 189118 292 0 1 71 50 115 0.680 3.31 6.02 Term + 190127 190209 83 0 2 125 37 50 0.720 1.46 6.03 PlyA + 190861 190866 6 1.05 7.02 PlyA - 195312 195307 6 1.05 7.01 Term - 198960 198813 148 2 1 81 42 101 0.367 2.17 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 101929 99998 1932 1 0 64 43 1311 0.954 118.12 S.002 Sngl + 104030 104239 210 1 0 53 41 234 0.875 10.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:69061310_69263073|GENSCAN_predicted_peptide_1|346_aa MGNFLPSLGFHFLIALWLAFLPEDWVRIPPGSACNRSDQSLESHGNSKRKGLQAYVPNYL MGQEVEVTHAGPGVRGVWCIYRTVSHSADPIWSGTVRDEGQGEESGERAPEKPGGPGYCA FSATWKEQLGLRTSGGLHPATGENETKKPQAHSVAFGKSYPVGDIHMFVVVALSLAYDCK TVRQSVLQSWELAQREGNGTPRPNLEMISGQEPAPAFNELNGWEYGESYKGELWDAVRTY TWGFFVLHERHRKGRSSRSQEASLGELQGQLGRAGAQTAAPAQTPGMGKAVVRAEVGRRL PALDLDCSHSNGRTQLELEGAILEAESHPSPDTKSVGTLILDFSEL >gi568815575r:69061310_69263073|GENSCAN_predicted_CDS_1|1041_bp atgggcaacttcctcccttctctgggcttccatttcctcattgccctatggttggccttc ctccctgaggactgggttcggatcccaccagggtctgcttgcaacagaagtgatcagagc ctagaatctcacggcaacagcaagaggaaaggcctgcaagcatatgtgcctaactaccta atggggcaggaagtggaagtaacacatgcagggccaggggtgaggggcgtctggtgcatt tatagaactgtgagtcattcagctgatccgatctggagtgggacagtcagggatgaaggc caaggagaggagagtggggagagggcccctgaaaagcccggaggccctgggtactgtgcc tttagtgccacgtggaaagaacagcttgggctcaggacttcaggtggtctccacccggcc actggagagaatgagacaaaaaagccccaggcccactctgtggcctttggcaagtcctac cctgttggggacatacacatgtttgtggtggtggccctgagtctggcctatgactgcaag actgtgagacagtctgtcctccagagctgggagctggcacaacgtgagggcaatggcacc ccaagacctaacttagagatgatatctggccaggagccagcccctgccttcaacgaactg aatggctgggagtatggggagtcctacaaaggtgaactgtgggatgctgtcaggacttac acgtggggattctttgtcctccacgaaaggcacaggaaaggccgatcaagcaggtcccag gaggcttccctgggggagctgcagggacaactgggccgggctggcgcccagacagcagcg cctgctcagactccggggatgggcaaggcagtggtgagggcggaggtggggaggcggctg ccggccctggacctggactgctcccactccaatggaaggacacaactagaactagaaggt gccatcttggaagcagagagtcacccctcaccagacaccaagtctgttggcaccttgatc ctggacttttcagaactgtga >gi568815575r:69061310_69263073|GENSCAN_predicted_peptide_2|226_aa MQKGSWRCCPNILPASLSRLDLALALAQPRQGTTIRTQALSRHLDYLMGSWLLTHTSSER LLKGLCSTPELLAVEAAAVVAAAAGRRWRAREAPPQGQAARHFALIAWSGRQVRSPAAPP ALHAFFPAKKERGELTGSRCRSPLGLDPKYPKYQPGHRETWGRAGGMLGTPGLIEGLDPT LKPPPVVLQGPEGKIWSLGPGLSSLLSLCLVLVLSCEAPQLFPHGR >gi568815575r:69061310_69263073|GENSCAN_predicted_CDS_2|681_bp atgcagaagggaagctggaggtgctgcccaaacatcctgcctgcgagcctgtcccggctc gatttggctctggccttagctcaaccacgccagggcacaactatcaggactcaagccctt tccaggcacttagactacctcatgggctcctggcttctcactcacactagcagcgagcgc ctgctgaaaggcctctgctccaccccggagctactggccgtggaggctgcggcggtggtg gcggcggcggcggggcgaagatggagagcccgcgaagctcctccccaggggcaggctgcc cgccactttgctttgatagcctggtccgggaggcaagtccgcagtcccgcagctcctccg gcgctgcacgcattcttcccggccaagaaggaacgcggcgagctaacgggatccagatgt aggagtcctctgggcttggaccccaaataccctaaataccagccaggccaccgggagacc tggggacgtgcaggaggcatgctggggactcccggcctcatcgagggcctggaccccacc ctcaagccccctccagttgtgctccaggggcctgagggcaagatctggagcctgggccca ggcctctcctctctcttgagtctgtgcttggtcctggtgcttagctgcgaggctccccag ctctttccccatgggagatga >gi568815575r:69061310_69263073|GENSCAN_predicted_peptide_3|686_aa MRNQAIRNQCERESPPAPLLSGGGGARDREQRLWRSWRSEIRELCFSEKRVAFWTWPPTS QRERIASQRKTNSEVPMHRSAPSQTTKRSRSPFSTTRRSWDDSESSGTNLNIDNEDYSRY PPREYRASGSRRGMAYGHIDSYGADDSEEEGAGPVERPPVRGKTGKFKDDKLYDPEKGAR SLAGPPPHFSSFSRDVREERDKLDPVPAARCSASRADFLPQSSVASQSSSEGKLATKGDS SERERREQNLPARPSRAPVSICGGGENTSKSAEEPVVRPKIRNLASPNCVKPKIFFDTDD DDDMPHSTSRWRDTANDNEGHSDGLARRGRGESSSGYPEPKYPEDKREARSDQVKPEKVP RRRRTMADPDFWTHSDDYYKYCDEDSDSDKEWIAALRRKYRSREQTLSSSGESWETLPGK EEREPPQAKVSASTGTSPGPGASASAGAGAGASAGSNGSNYLEEVREPSLQEEQASLEEG EIPWLQYHENDSSSEGDNDSGHELMQPGVFMLDGNNNLEDDSSVSEDLEVDWSLFDGFAD GLGVAEAISYVDPQFLTYMALEERLAQAMETALAHLESLAVDVEVANPPASKESIDALPE ILVTEDHGAVGQEMCCPICCSEYVKGEVATELPCHHYFHKPCPCRLLPAKVRGSRLFRRP LPVPAGPATSSEPRPSAADQQVSRVK >gi568815575r:69061310_69263073|GENSCAN_predicted_CDS_3|2061_bp atgcgtaatcaggcaattaggaaccagtgtgagcgggagtcgccgcccgccccgctgctg tctggcgggggaggagctcgggaccgggagcagcggctctggcggtcgtggcggtcggag atccgagagctctgcttctctgaaaagcgtgtggctttctggacttggccacccacgagc cagcgggaaaggattgccagccagagaaagacgaactccgaagtcccaatgcacagatca gcccccagtcaaaccaccaagaggagccgatcgccattttccactactcgtcgtagttgg gacgacagcgagagttcgggaaccaacctgaatattgataatgaggactattccaggtat ccgccaagagagtacagagcttcgggtagcagaagaggaatggcctatggacatattgac tcttatggggcagatgatagtgaggaggagggggctgggcctgttgagcgaccgccagtg agagggaaaactggcaagtttaaagatgataagctgtatgacccagagaaaggggcaagg tctttggctgggccacctccacatttctctagttttagccgtgatgtgagagaggagcga gacaagttagacccagtccctgcagcaagatgctcagctagcagagctgacttcctgcca caaagtagtgtggcctcacagtcgtcttctgaaggcaagctggctacaaaaggtgacagc tcggagagggagagaagggagcaaaatttacctgcacgtcccagcagggctcctgtgagt atttgtggtggtggggaaaacacctcaaagagtgcagaggaacctgtggtcaggcccaaa atcagaaacctggcaagtccaaactgcgtgaaaccaaaaattttttttgatactgatgat gatgacgatatgccacacagtacttccaggtggagggataccgccaatgacaatgagggc cactcggatggcctggcaagaagagggagaggcgagagttcaagtggctatcccgagcca aagtaccctgaagacaaacgggaagcgaggagtgaccaagtgaaaccagaaaaggtgccg agacgacgacgcaccatggccgaccctgacttctggacgcacagtgatgattactacaaa tactgcgacgaagactctgacagtgacaaagagtggattgctgctctgcgtcggaaatat cgaagccgagagcaaaccctgtcctccagtggcgaaagctgggagactctgccggggaaa gaagagcgggaacctccacaggctaaggtgagtgccagcactggcaccagccctggcccc ggtgctagtgccagtgccggggctggcgccggggccagtgctggcagcaatggcagcaat taccttgaagaagttcgagaaccatctcttcaggaagagcaggcatccctggaagaagga gaaattccttggctccagtaccatgagaatgacagtagcagtgagggggataatgattct ggtcacgagttgatgcaacctggggtattcatgctggatggaaacaacaaccttgaagat gactccagtgtgagcgaagacctagaagtggattggagcctctttgatggatttgcagat gggttaggagtggctgaagccatttcctatgtggaccctcagttcctcacctacatggca cttgaagaacgcctggcccaggcaatggaaactgcccttgcgcacttggagtctctcgca gtggatgtagaggtggccaatccaccagcaagcaaggagagcattgacgctcttcccgag atcctggtcactgaagatcatggcgcagttggtcaggagatgtgctgccccatctgctgt agcgaatatgtgaagggggaggtggcaactgagctgccgtgccaccactatttccacaag ccgtgcccctgccgcctgctccctgccaaggtcagagggtcaaggctcttccggcgcccc ctccccgtgccggcagggcccgccacctcctcggagccccgcccctccgcggccgaccag caggtctctcgcgtcaaatga >gi568815575r:69061310_69263073|GENSCAN_predicted_peptide_4|372_aa MRARERAGGPNLLLEKSPPSTRAPMDQLVPLQLSWWKSHPTSGPLPSLLIACVKKKLLVP KRFQENRILDSLSDPEGTLIMPKSLEIYKGSCNWEESGLLGSCFSQESSKVADSGPLNVT GPKPKMLRSQPKDPFYALGAGQQASKKPLCSHWLLTMSHPPPSLETGFSPDYLAPTMPPL GSPIIDPNPDSLLLKPGRVGGHTEAMRKLVLSRPLGTSYWTVAVPGLGQGVTRRILAKVR KAILASQKQLLCLHLGVSRSKALPAPPPRAGRAPYLLGAPPLRGRPAGSPRRNLLQSFGE HRMVGTSACLWMTKPLDLQELIAGVAAAPSPLPALLPVPTSHTIPLKKGKQETCLCYCHE LWRIVSGKCFLT >gi568815575r:69061310_69263073|GENSCAN_predicted_CDS_4|1119_bp atgagggccagagaacgagcaggagggccgaacctactgctggagaagtccccgccttcc acccgagcacccatggatcagctggtgcccctacagctatcctggtggaagagccacccc acttctggcccacttccttctcttttaatagcatgtgtgaagaagaagctgttggtgcca aaaagattccaagaaaatagaatcctagactcactgtctgatccggaaggcactttaatc atgcctaagagcctggagatttataaaggaagctgtaactgggaagaatccggcctgctg ggaagctgcttctcccaggaatcctccaaggttgcagactcagggccactgaatgtcact ggcccaaagcccaagatgctgaggtctcaaccaaaggacccattttatgcactgggggca ggacagcaagccagcaagaagcccctgtgttctcactggcttcttaccatgagccaccca cctccatccctggagactggcttctccccggattacctggcacccactatgccccctctt ggcagtcccatcattgaccccaacccagactctctcctactcaagccaggaagagtggga gggcacacagaggccatgaggaaactcgtgctgagccggcccttggggacctcatactgg acagtggccgtcccgggcttgggccaaggggttactaggaggatcctggccaaagtccgg aaagccatccttgcaagccagaagcagctgctctgcctgcatttgggggtcagcaggtcc aaggctcttccggcgccccctccccgtgccggcagggccccctacctcctcggagccccg cccctccgcggccgacccgcaggcagtcctcgaaggaacctcctccagtcctttggagaa cacaggatggttggcaccagtgcctgcctgtggatgactaaaccattagacctgcaggaa ctcattgctggtgtggcagcagccccttctccactgcctgccctactgccagtgcccacc agccatacaattcctctgaagaaaggcaagcaggagacttgcctttgctactgccacgag ctctggagaattgtatctggtaaatgtttcttaacctag >gi568815575r:69061310_69263073|GENSCAN_predicted_peptide_5|322_aa MSKNQHKKAENSKNQNASSLPKDHNSSPAREQNWMENEFDKLTKVGFRKWVIANSSELTE HVLTQCKEAKNLDKRLEELLTRISSLEKNINDRMELKNIARELHEAYTSINSQINQAEER LSKIEDQLNEIKHEDRIREKRMKRNEQSFQEIWDYVKRPNLHLIGVPESDRENGNNLENT FQDIIQENFPNLATQANIQIQETQRTPQRYSSRRGNPTAWKLNNLLLNDYWVNNEIKIEI NKFFETNENKETMYQNLWDTAKAVFRGKFVALNAHKRQQERSKINTLTSQLKELEKQMQT NSKASRRQEITNIRAELKDMGI >gi568815575r:69061310_69263073|GENSCAN_predicted_CDS_5|969_bp atgagtaaaaaccagcacaaaaaggctgaaaattccaaaaaccagaatgcctcttctctt ccaaaggatcacaactcctcaccagcaagagaacaaaactggatggagaatgaatttgac aaattgacaaaagtaggcttcagaaagtgggtaatagcaaactcctctgagctaacggag catgttctaacccaatgcaaggaagctaagaaccttgataaaaggttggaggaattgcta actagaataagcagtttagagaagaacataaatgaccggatggagctgaaaaacatagca cgagaacttcatgaagcatatacaagcatcaacagccaaatcaatcaagcggaagaaagg ttatcaaagattgaagatcaacttaatgaaataaagcatgaagacaggattagagaaaaa agaatgaaaaggaacgaacaaagcttccaagaaatatgggattatgtgaaaagaccaaac ctacatttgattggtgtacctgaaagtgacagggagaatggaaataacttggaaaacaca tttcaggatattattcaggagaacttccccaacctagcaacacaggccaacattcaaatt caggaaacacagagaacaccgcaaagatactcctcaagaagaggaaatccaactgcatgg aaactgaacaacctgctcctgaatgactactgggtaaataatgaaattaagatagaaata aataagttctttgaaaccaatgagaacaaagagacaatgtaccagaatctctgggacacg gctaaagcagtgtttagaggaaaatttgtagcactaaatgcccacaagagacagcaggaa agatctaaaatcaacaccctaacatcacaattaaaagaactagagaagcaaatgcaaaca aattcaaaagctagcagaagacaagaaataactaatatcagagcagaactgaaggacatg gggatatga >gi568815575r:69061310_69263073|GENSCAN_predicted_peptide_6|124_aa MGKDFMTKTPKAIATKARIDKCDLIKLKSFCTAKETIFRMNRQPTEWEKIFAIYPSDKGL ISSIYKELKQIYKKKTKQNNPIKKWAKDMNRHLQKKTSSLNGYICAAHSGGHKLHMGIKH LKCD >gi568815575r:69061310_69263073|GENSCAN_predicted_CDS_6|375_bp atgggcaaagacttcatgactaaaacaccaaaagcaattgcaacaaaagccagaattgac aaatgtgatctaattaaactaaagagcttctgcacagcaaaagaaactatcttcagaatg aacaggcaacctacagaatgggagaaaatttttgcaatctacccatctgacaaagggcta atatccagcatctacaaggaacttaaacaaatttacaagaaaaaaacaaaacaaaacaac cctatcaaaaagtgggcgaaggatatgaacagacaccttcaaaagaagacatcttctctg aatggctacatttgtgctgcccattctggtgggcacaagctgcacatgggtattaagcac ttgaaatgtgactag >gi568815575r:69061310_69263073|GENSCAN_predicted_peptide_7|49_aa XLCTPMIFVTSMQTETVGGQCGALDDQGMSNGSSLPGEEGFLQKLWGEG >gi568815575r:69061310_69263073|GENSCAN_predicted_CDS_7|150_bp nncctctgcacgccaatgatttttgtgacctccatgcagactgagactgttggaggacag tgtggagccctggatgatcaaggcatgtcaaatggcagcagcctgccaggagaggaaggt ttcctgcagaagctatggggagagggttag