GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:15:55 Sequence gi568815592r:107944788_108174651 : 229864 bp : 43.55% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1151 1146 6 1.05 1.02 Term - 9284 9190 95 0 2 83 49 40 0.411 -2.41 1.01 Init - 13222 13099 124 1 1 96 117 219 0.817 25.93 1.00 Prom - 27651 27612 40 -4.66 2.06 PlyA - 27893 27888 6 1.05 2.05 Term - 60472 59570 903 1 0 4 38 1455 0.996 124.43 2.04 Intr - 60752 60515 238 1 1 55 -8 280 0.297 12.72 2.03 Intr - 61120 61070 51 0 0 80 80 43 0.288 0.72 2.02 Intr - 65188 65164 25 2 1 77 98 3 0.041 -2.72 2.01 Init - 71585 71450 136 2 1 53 57 103 0.271 4.00 2.00 Prom - 76982 76943 40 -0.56 3.02 PlyA - 77212 77207 6 1.05 3.01 Sngl - 87565 87197 369 1 0 88 49 116 0.812 3.97 3.00 Prom - 89580 89541 40 -2.96 4.00 Prom + 89694 89733 40 -5.36 4.01 Init + 102566 102652 87 1 0 92 106 60 0.928 8.84 4.02 Intr + 110759 110812 54 1 0 88 49 53 0.240 0.58 4.03 Term + 113042 113056 15 1 0 120 44 -1 0.266 -3.06 4.04 PlyA + 113118 113123 6 1.05 5.03 PlyA - 115450 115445 6 1.05 5.02 Term - 118856 118845 12 2 0 134 32 4 0.284 -2.40 5.01 Init - 129864 129463 402 0 0 79 47 641 0.775 53.73 5.00 Prom - 141620 141581 40 -2.66 6.05 PlyA - 141761 141756 6 1.05 6.04 Term - 155510 155480 31 1 1 114 34 47 0.124 -0.77 6.03 Intr - 161749 161568 182 1 2 91 13 111 0.081 2.57 6.02 Intr - 173332 173201 132 1 0 53 50 103 0.237 3.84 6.01 Init - 173855 173673 183 2 0 17 69 180 0.459 6.12 6.00 Prom - 187287 187248 40 -4.06 7.00 Prom + 202433 202472 40 -4.46 7.01 Init + 211090 211126 37 0 1 65 93 37 0.052 2.10 7.02 Intr + 221865 222003 139 1 1 83 39 75 0.077 1.62 7.03 Intr + 223504 223614 111 1 0 50 100 91 0.732 5.99 7.04 Intr + 224699 224829 131 2 2 79 70 71 0.874 4.74 7.05 Intr + 225386 225472 87 0 0 64 43 71 0.492 0.14 7.06 Intr + 226671 226816 146 1 2 97 82 190 0.757 19.30 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:107944788_108174651|GENSCAN_predicted_peptide_1|72_aa MAGQQFQYDDSGNTFFYFLTSFVGLIVIPATYYLWPRDQNADMATIRFPNPFPAFPLFLF HKTAIVIMARSQ >gi568815592r:107944788_108174651|GENSCAN_predicted_CDS_1|219_bp atggccgggcagcagttccagtacgatgacagtgggaacaccttcttctacttcctcacc tccttcgtggggctcatcgtgatcccggcgacatactacctctggccccgagatcagaat gccgacatggcaaccatccgatttcccaatcctttccctgcctttcccctctttctattc cacaaaaccgccattgtcatcatggcccgttctcaatga >gi568815592r:107944788_108174651|GENSCAN_predicted_peptide_2|450_aa MWECLELPRDLLSGFAQNADSDMDNKIQAKVVSDGDEELVGNWSKVREHLIHIRYFQSLM VTYKIFIDVARHGSLGFLPQKRSSRHRGKVKSFPKDDPSKPVHLTAFLGYKAGMTHIMRE VDSPASKVNKKQVVEAVTIVETPPMVVVGITVFAEHISNECKRRFYKNWHKSKKKAFTKY CKKWQDEDVKKQLEKDFSSMKKYCHVIRVIAHTQMRLLPLRQKKAHLMEIQVNGGTVAEK LDWVCERLEQQVNVNQVFGQDEMIDVIGVTKGKGYKGVTSRWHTKKLPCKTHRGLCKVAC IGAWHPARVAFSVARAGEKGYRHRTEINKKIYKIGQGYLIKDGKLIKNNASTDSDLSDKS TNPLGGFVHYGEVTNDLVMLKGCVVGTKKRVLTLHESLLVQMKRQALEKIDLKFIDTTSK FGHGCFQTMEQKKAFMGPLKKDGIAKEEGA >gi568815592r:107944788_108174651|GENSCAN_predicted_CDS_2|1353_bp atgtgggaatgtttggaacttcctagagacttgttgagtggctttgcccaaaatgctgat agcgatatggacaataaaatccaggctaaagtggtctcagatggagatgaggaacttgtt gggaactggagcaaagtcagagaacatcttatacacatcagatattttcagtccttaatg gtcacctacaaaatattcattgatgtggccagacatgggtccctcggcttcctgcctcag aagcgcagcagcaggcatcgtgggaaggtgaagagcttccctaaggatgatccgtccaag ccggtccacctcacagccttcctgggatacaaagctggcatgacccacatcatgagggaa gtcgacagcccagcatccaaggtgaacaagaagcaggtggtggaggctgtgaccattgtg gagacgccacccatggtggttgtgggcattactgtcttcgctgagcacatcagtaatgaa tgcaagaggcgtttctataagaactggcataaatctaaaaagaaggcctttacgaagtac tgcaagaaatggcaggatgaggatgtcaagaagcagctggagaaggacttcagcagcatg aagaaatactgccacgtcatccgcgtcattgcccacacccagatgcgcctgcttcctctg cgccagaagaaggcccacctgatggagatccaggtgaacggaggcaccgtggctgagaag ctggactgggtctgcgagaggctcgagcagcaggtaaatgtgaaccaagtgtttgggcag gatgagatgatcgacgtcattggggtgaccaagggcaaaggctacaaaggggtcaccagt cgttggcacaccaagaagctgccctgcaagacccaccgaggcctgtgcaaggtggcctgt attggggcatggcatcctgctcgtgtggccttctctgtggcacgtgctggggagaaaggc taccgtcaccgcactgagatcaacaagaagatctataagattggccagggctaccttatc aaggatggcaagctgatcaagaacaatgcctccactgactctgacctgtctgacaagagc accaatcctctgggtggctttgtccactatggtgaagtgaccaatgaccttgtcatgctg aaaggctgtgtggtgggaaccaagaagcgggtgctcaccctccacgagtccttgctggtg cagatgaaacggcaggctctggagaagattgaccttaagttcattgacaccacctccaag tttggccatggctgcttccagaccatggagcagaagaaagcattcatgggaccactcaag aaagacggaattgcaaaggaagaaggagcttaa >gi568815592r:107944788_108174651|GENSCAN_predicted_peptide_3|122_aa MGTLISVMPWALHRACSRQCPKWPARSCTHSLTPDLAMSPTWSLLLPVTRTAGEIPHSLA HGWFGHGPHMELASAGARSGRLDPALTRSQAPTHKGLNVVGRASVPSGWESSEGAKKNPA SI >gi568815592r:107944788_108174651|GENSCAN_predicted_CDS_3|369_bp atgggcaccctcatctcggtgatgccatgggccctgcacagagcttgctcccgccagtgc ccaaagtggccagccagatcctgcactcactcgctcactcctgatctggccatgagcccc acatggagcctgctcctgccagtgactagaacagctggcgagatcccacactcacttgct catggctggtttggccatgggccccacatggagcttgcttctgctggtgcccggagcggc cggctggatcctgcactcactcgctcacaggctcccacccacaaggggttgaatgtggtg ggccgagcaagtgttccctccggttgggagtccagtgaaggggccaagaaaaatcctgca tcaatctga >gi568815592r:107944788_108174651|GENSCAN_predicted_peptide_4|51_aa MSNEENGCDEFRKVAMGQVMYSLESRPWKPAISNRGSTAWFQILELVELGA >gi568815592r:107944788_108174651|GENSCAN_predicted_CDS_4|156_bp atgagcaatgaggagaatggatgtgacgagtttagaaaggtggctatgggccaggtcatg tacagcttggagagcagaccctggaaacctgctatctccaatcgaggttcaactgcctgg ttccagatcttggaacttgtggaattgggagcttaa >gi568815592r:107944788_108174651|GENSCAN_predicted_peptide_5|137_aa MEPGPTAAQRRCSLPPWLPLGLLLWSGLALGALPFGSSPHRVFHDLLSEQQLLEVEDLSL SLLQGGGLGPLSLPPDLPDLDPECRELLLDFANSSAELTGCLVRSARPVRLCQTCYPLFQ QVVSKMDNISRAAGVCV >gi568815592r:107944788_108174651|GENSCAN_predicted_CDS_5|414_bp atggagccgggcccgacagccgcgcagcggaggtgttcgttgccgccgtggctgccgctg gggctgctgctgtggtcggggctggccctgggcgcgctccccttcggcagcagtccgcac agggtcttccacgacctcctgtcggagcagcagttgctggaggtggaggacttgtccctg tccctcctgcagggtggagggctggggcctctgtcgctgcccccggacctgccggatctg gatcctgagtgccgggagctcctgctggacttcgccaacagcagcgcagagctgacaggg tgtctggtgcgcagcgcccggcccgtgcgcctctgtcagacctgctaccccctcttccaa caggtcgtcagcaagatggacaacatcagccgagccgcgggggtttgtgtgtaa >gi568815592r:107944788_108174651|GENSCAN_predicted_peptide_6|175_aa MRKKRAPSRYLPLRAPQAAGPRQPGFSRRAGRPLRPLRPLPTAPAARRAHGLGGWVLARA QRNFEGGNCSGAHGLGGGSPSLSLHRVWEGLKPKLLGLRRVRGRKPFVFEPNVKSLPLGS HYYPSKPLGFSSSSPLEGRLPSRTFPCSYVNPCAYICYVSAGAFVMIPSKMPHYI >gi568815592r:107944788_108174651|GENSCAN_predicted_CDS_6|528_bp atgcgaaagaaacgcgcccctagccggtacctcccgctccgggccccgcaggcggctggg cctcgtcagcccgggttcagccgccgcgccggccgccccctgcgccccctgcgccccctg cccacggcccccgcagcccggcgggcgcacggcctcggcggctgggtcctcgcgcgggcg cagaggaacttcgagggcgggaactgctccggcgctcatggactcgggggcggcagcccg agcctctccttgcaccgtgtctgggagggcctgaagcccaaactcctcgggctgagaagg gtccggggccgcaagccctttgtctttgaaccaaatgtcaagtccctgcccctgggaagc cattattaccccagcaagcccctgggcttctcaagttccagtcctctagagggccgcctc ccctccaggacctttccttgctcctatgtaaatccctgtgcctacatctgctatgtgtcc gctggggcctttgttatgatcccgtccaagatgccacattacatctga >gi568815592r:107944788_108174651|GENSCAN_predicted_peptide_7|217_aa MDMLSIATLANKGALQAGGQLESGGARRRGGRCRPGLGQRPPTAPPRDSQHEQASRINKP PSSLAPGQALAYTPGSKEPRALSSFSGCGAPGALESATGKRTAGVTVLLGPKRGPWSGQR ARVKQRMVVRYGNQVYRFQIPNPSAFGSSCRISGRPEAAGAQTGGLRIGRILDIPCKVCG DRSSGKHYGVYACDGCSGFFKRSIRRNRTYVCKSGNQ >gi568815592r:107944788_108174651|GENSCAN_predicted_CDS_7|651_bp atggacatgctcagcattgccacacttgccaataaaggagccttgcaggctggagggcag ctggagagcggcggcgcccggcggcgaggcgggcgctgccggccgggactcgggcagcgc ccaccaaccgctccgccccgggacagccagcatgagcaagccagccggatcaacaagcct cctagctcgctcgccccgggacaggccctcgcctacacccctggaagtaaggagccccgg gctctttcgtccttttcggggtgtggagcccctggggcccttgaaagtgccacagggaag aggacagctggggtgacagtactgctgggccccaaacgtggaccctggagcggtcagagg gcgcgtgttaagcagaggatggttgtaagatatggaaaccaagtttaccgcttccagatc cccaacccaagcgctttcggcagcagctgtcgaatttcagggaggcctgaggccgctggg gcccaaactggaggccttcggataggccgcattttagatatcccctgcaaagtgtgtggc gaccgcagctcggggaagcactacggggtctacgcctgcgacggctgctcaggttttttc aaacggagcatccgaaggaataggacctatgtctgcaaatctggaaaccag