GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:23:12 Sequence gi568815586f:121526743_121741640 : 214898 bp : 49.80% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.12 PlyA - 3608 3603 6 1.05 1.11 Term - 4318 4143 176 2 2 91 52 76 0.983 2.22 1.10 Intr - 6217 6064 154 1 1 98 94 198 0.999 21.05 1.09 Intr - 7848 7755 94 2 1 121 113 71 0.993 12.87 1.08 Intr - 22241 22135 107 2 2 69 69 178 0.112 13.01 1.07 Intr - 22896 22718 179 1 2 64 66 343 0.969 29.34 1.06 Intr - 23135 23078 58 2 1 79 98 47 0.723 3.26 1.05 Intr - 34482 34355 128 1 2 12 23 115 0.010 -2.20 1.04 Intr - 47851 47805 47 0 2 114 106 29 0.790 5.35 1.03 Intr - 49117 49039 79 2 1 32 97 62 0.760 0.11 1.02 Intr - 52204 52060 145 1 1 84 100 438 0.982 44.46 1.01 Init - 52677 52510 168 0 0 61 -4 198 0.680 5.44 1.00 Prom - 63092 63053 40 -5.76 2.00 Prom + 63724 63763 40 -3.06 2.01 Init + 66684 66805 122 2 2 39 109 101 0.661 6.99 2.02 Term + 89496 89664 169 0 1 85 48 111 0.206 4.15 2.03 PlyA + 90115 90120 6 1.05 3.00 Prom + 98878 98917 40 -4.56 3.01 Init + 99456 99682 227 2 2 69 48 162 0.439 6.54 3.02 Intr + 99796 99905 110 1 2 10 76 117 0.666 2.63 3.03 Intr + 99926 100308 383 0 2 5 89 463 0.970 32.73 3.04 Term + 114299 114901 603 1 0 133 47 1222 0.988 117.12 3.05 PlyA + 116348 116353 6 1.05 4.06 PlyA - 117376 117371 6 1.05 4.05 Term - 124902 124804 99 0 0 128 46 65 0.674 4.43 4.04 Intr - 126517 126333 185 2 2 112 64 292 0.997 28.71 4.03 Intr - 127691 127532 160 2 1 59 61 310 0.999 24.96 4.02 Intr - 132606 132449 158 1 2 66 97 100 0.311 8.43 4.01 Init - 142741 142597 145 1 1 103 113 159 0.963 20.18 4.00 Prom - 161955 161916 40 -3.66 5.00 Prom + 167524 167563 40 -5.16 5.01 Init + 177554 177557 4 1 1 89 92 0 0.306 0.66 5.02 Intr + 186038 186222 185 0 2 48 91 209 0.884 16.71 5.03 Term + 193486 193536 51 0 0 66 41 72 0.062 -2.27 5.04 PlyA + 195818 195823 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 22241 22131 111 2 0 69 47 184 0.873 10.96 S.002 Term - 160569 160376 194 1 2 48 45 156 0.860 4.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:121526743_121741640|GENSCAN_predicted_peptide_1|444_aa MPPAPGSGFPALPPRPLGLLLPPSRAPQRLLLGPTFAPPARVRSPLRGLHNQAPGRRPID RQRYDENEDLSDVEEIVSVRGFSLEEKLRSQLYQGDFVHAMEGKDFNYEYVQREALRVPL IFREKDGLGIKMPDPDFTVRDVKLLVAFKMTFSSLVLAPSPRLSELTDGTFKAGATGPWT NFRSWNLLDVAFEVIADWRGGHGGAGKQGSRRLVDVMDVNTQKGTEMSMSQFVRYYETPE AQRDKLYNVISLEFSHTKLEHLVKRPTVVDLVDWVDNMWPQHLKEKQTEATNAIAEMKYP KVKKYCLMSVKGCFTDFHIDFGGTSVWYHVFRGGKIFWLIPPTLHNLALYEEWVLSGKQS DIFLGDRVERCQRIELKQGYTFFIPSGVAGLALDSKDDWKALGLAADSVTLAGLFLSLHL SSLYNGGGVELNSGTLKFPPNPIL >gi568815586f:121526743_121741640|GENSCAN_predicted_CDS_1|1335_bp atgccgccggcccccggttccggcttcccggctctgccgccgcggccgctggggctgctg ttaccgccgagccgggcgccgcagcggctcctcctggggcctacatttgccccgccagcc cgtgtccggagcccccttcgcgggttacataaccaggctcccggccgacgcccgattgac cgccagcgatacgacgagaacgaggacttgtcggacgtggaggagatcgtcagcgtccgc ggcttcagcctggaggagaagcttcgcagccagctgtaccagggggacttcgtgcacgcc atggagggcaaagatttcaactatgagtacgtacagagagaagctctcagggttcccctg atatttcgagaaaaggatggactgggaattaagatgcctgaccctgatttcacagtccga gacgtcaaactcctagtggccttcaagatgactttcagcagcctcgtcctggccccgagc ccgaggctttcggagctgacggatggaaccttcaaagcaggggccacagggccatggact aatttccgcagctggaacctgttggatgtagcttttgaggtcatcgcagactggcgtgga ggccatggtggtgcagggaagcaagggagccggcggcttgtggacgtgatggatgtgaac acccagaagggcacggagatgagcatgtcccagtttgtgcgttactacgagacgcccgag gcccagcgggacaagctgtacaacgtcatcagcctagagttcagccacaccaagctggag cacttggtcaagcgtccgactgtggtagacctggtggactgggtggacaacatgtggccc cagcatctgaaggagaagcagacagaagccacgaacgccattgcagagatgaagtacccg aaagtgaaaaagtactgtctgatgagcgtgaaaggttgtttcaccgacttccacatcgac tttggaggcacttccgtttggtaccatgttttccggggtgggaagattttttggctgatt cctccaacgctgcacaatttggcgctgtacgaggagtgggtgctgtcaggcaaacagagt gacatctttctgggagaccgtgtggaacgatgccaaagaattgagctgaagcagggctac acatttttcatcccttccggtgttgctggattggccctggattcaaaggatgactggaag gctcttggccttgctgctgactctgtgacgttagctggtctcttcctctctctgcacctc agctctctctacaatggtggtggggtggagctgaactctgggaccctcaagttccctccc aaccccatcctctga >gi568815586f:121526743_121741640|GENSCAN_predicted_peptide_2|96_aa MAESVTIPLMRLRCSEMKTLALIPSLKRENSEMAPLQIESSVACFANRGCQTVQENDPAD THQVKGYSDSPRHMCIDKIYPNILYSKFSFEKGELD >gi568815586f:121526743_121741640|GENSCAN_predicted_CDS_2|291_bp atggcagagagtgtgacgatccccctgatgcggctgagatgttctgaaatgaagacgttg gctctcatccccagcctgaagagagaaaattctgagatggctcccttacagattgagagc agtgtggcctgctttgctaatagaggatgtcagaccgtccaggaaaatgatcctgctgac acccaccaagttaagggatatagtgattctccaaggcatatgtgtattgacaaaatttac cccaacattctgtacagcaaattcagttttgaaaaaggagaactagactag >gi568815586f:121526743_121741640|GENSCAN_predicted_peptide_3|440_aa MGLRRRPEVTSGGPTQESKLQEQGHGSSCGEGRRQEPERAAGRFQEKWRAAARQGPWAVP SERAGHVTPTTTPTSCLWSLPPGGFCQRRRGPACWGSGHFFDLVLLVLCGRPGGSRRLPP AEVPRGARAGPRLGGVLHASGARPAPEPQQSRASPKRRQHHQRQPPEPPPQRGRGAPGGP APPPPPSAVTYPDWIGQSYSEVMSLNEHSMQALSWRKLYLSRAKLKASSRTSALLSGFAM VAMVEVQLDADHDYPPGLLIAFSACTTVLVAVHLFALMISTCILPNIEAVSNVHNLNSVK ESPHERMHRHIELAWAFSTVIGTLLFLAEVVLLCWVKFLPLKKQPGQPRPTSKPPASGAA ANVSTSGITPGQAAAIASTTIMVPFGLIFIVFAVHFYRSLVSHKTDRQFQELNELAEFAR LQDQLDHRGDHPLTPGSHYA >gi568815586f:121526743_121741640|GENSCAN_predicted_CDS_3|1323_bp atgggcctcaggaggcgcccagaggtgacctcaggcggcccgacccaggagtccaagctc caggagcagggccacgggagcagctgcggagaggggcggcgccaggagccggagcgggca gccgggcgcttccaggaaaagtggcgggcggcggcgcgccagggaccgtgggcggtgccg tcggagcgggcgggtcacgtgacgcccacaacaacgcccacttcttgtctctggtcactg ccgcccgggggcttttgccagcggcgccgcgggcctgcgtgctggggcagcgggcacttc ttcgacctcgtcctcctcgtcctgtgcggccggccggggggcagtcggcggctgcctccg gcggaggtgcctcgcggcgcccgggccggcccgcgcctcggcggcgtgctccatgcatcc ggagcccgccccgcccccgagccgcagcagtcccgagcttcccccaagcggcggcagcac caccagcggcagccgccggagccgccgccgcagcggggacggggagcccccgggggcccc gccccgccaccgccgccgtccgccgtcacctacccggactggatcggccagagttactcc gaggtgatgagcctcaacgagcactccatgcaggcgctgtcctggcgcaagctctacttg agccgcgccaagcttaaagcctccagccggacctcggctctgctctccggcttcgccatg gtggcaatggtggaggtgcagctggacgctgaccacgactacccaccggggctgctcatc gccttcagtgcctgcaccacagtgctggtggctgtgcacctgtttgcgctcatgatcagc acctgcatcctgcccaacatcgaggcggtgagcaacgtgcacaatctcaactcggtcaag gagtccccccatgagcgcatgcaccgccacatcgagctggcctgggccttctccaccgtc atcggcacgctgctcttcctagctgaggtggtgctgctctgctgggtcaagttcttgccc ctcaagaagcagccaggccagccaaggcccaccagcaagccccccgccagtggcgcagca gccaacgtcagcaccagcggcatcaccccgggccaggcagctgccatcgcctcgaccacc atcatggtgcccttcggcctgatctttatcgtcttcgccgtccacttctaccgctcactg gttagccataagactgaccgacagttccaggagctcaacgagctggcggagtttgcccgc ttacaggaccagctggaccacagaggggaccaccccctgacgcccggcagccactatgcc tag >gi568815586f:121526743_121741640|GENSCAN_predicted_peptide_4|248_aa MPVSKCPKKSESLWKGWDRKAQRNGLRSQVYAVNGDYYVGEWKDNVKHGKGTQVWKKKGA IYEGDWKFGKRDGYGTLSLPDQQTGKCRRVYSGWWKGDKKSGYGIQFFGPKEYYEGDWCG SQRSGWGRMYYSNGDIYEGQWENDKPNGEGMLRLKNGNRYEGCWERGMKNGAGRFFHLDH GQLFEGFWVDNMAKCGTMIDFGRDEAPEPTQFPIPERTQTLQEKFKPVSPDRSDQCGSGW RSQQQLQA >gi568815586f:121526743_121741640|GENSCAN_predicted_CDS_4|747_bp atgccagtctctaagtgcccaaaaaagtcggagtccctgtggaaggggtgggaccggaag gcccagaggaacggcctgcggagccaggtatacgctgtgaatggcgactactatgtgggc gagtggaaggacaacgtgaaacacgggaaaggaacacaggtctggaagaagaaaggagcc atctatgagggggactggaagtttgggaagcgagacggctacggcaccctcagccttcct gaccaacagacaggaaagtgcaggagagtctactcaggctggtggaaaggtgataagaaa tcgggttatgggatccagtttttcggacccaaggagtattatgagggtgactggtgtggc agccagcgcagcgggtggggccgcatgtattacagcaacggcgacatctacgagggacag tgggagaacgacaagcccaacggggagggcatgctgcgcctgaagaacgggaaccgctac gagggctgctgggagagaggcatgaagaacggggcggggcgtttcttccatctggaccac ggccagctgtttgaaggcttctgggtggacaatatggccaaatgcgggacgatgatcgac tttggccgtgacgaggcccctgagcccactcagttccccattcctgagagaacacaaacg cttcaggagaaattcaagcctgtgtcacccgatcgctcagaccagtgcggctctggctgg aggagtcagcagcagctccaggcatga >gi568815586f:121526743_121741640|GENSCAN_predicted_peptide_5|79_aa MGTAGASEGGRRARGRGALGGRSGSAAGAAAGTAALHHRIMSGQLERCEREWHELEGEFQ ELQTPTYRSRNIPFMDSDT >gi568815586f:121526743_121741640|GENSCAN_predicted_CDS_5|240_bp atgggaacagctggtgcctccgagggcggtcggcgagcgcgcgggcgtggggcgctgggg ggccggtcgggcagcgctgcgggagcagccgccggcaccgccgccttgcaccatcgcatc atgtccgggcagctggagcgttgcgagcgcgaatggcacgagctggagggagaatttcaa gaactgcagacccccacttaccgctcccgtaacataccgttcatggattctgacacatga