GENSCAN 1.0 Date run: 6-Nov-116 Time: 22:35:13 Sequence gi568815590r:6997752_7116850 : 119099 bp : 45.63% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5040 5219 180 2 0 90 91 69 0.269 6.58 1.02 Intr + 8662 8813 152 0 2 13 82 99 0.194 0.76 1.03 Term + 9521 9698 178 2 1 60 42 114 0.175 1.16 1.04 PlyA + 10493 10498 6 1.05 2.00 Prom + 10825 10864 40 -5.86 2.01 Init + 12176 12318 143 1 2 106 75 99 0.543 8.03 2.02 Term + 16186 16213 28 1 1 119 38 27 0.463 -1.65 2.03 PlyA + 16737 16742 6 1.05 3.06 PlyA - 16804 16799 6 1.05 3.05 Term - 18348 18239 110 1 2 106 49 65 0.973 3.07 3.04 Intr - 19111 18925 187 1 1 135 94 121 0.516 16.66 3.03 Intr - 19665 19582 84 0 0 69 115 22 0.452 3.02 3.02 Intr - 22982 22902 81 2 0 74 60 95 0.055 5.13 3.01 Init - 31079 31059 21 2 0 108 100 16 0.539 4.59 3.00 Prom - 31815 31776 40 -4.66 4.04 PlyA - 32371 32366 6 1.05 4.03 Term - 40932 40820 113 1 2 99 43 77 0.835 3.02 4.02 Intr - 41700 41517 184 0 1 135 94 55 0.899 10.16 4.01 Init - 45185 45108 78 2 0 77 62 93 0.952 4.79 4.00 Prom - 51858 51819 40 -4.66 5.03 PlyA - 51990 51985 6 1.05 5.02 Term - 57792 57680 113 1 2 143 49 65 0.807 6.82 5.01 Init - 58946 58775 172 2 1 91 91 241 0.433 22.30 5.00 Prom - 65154 65115 40 -4.46 6.00 Prom + 65430 65469 40 -9.16 6.01 Init + 66626 66826 201 1 0 103 9 138 0.597 6.28 6.02 Intr + 72008 72119 112 1 1 29 69 36 0.021 -4.25 6.03 Intr + 74335 74508 174 2 0 61 116 76 0.578 7.61 6.04 Intr + 76257 76352 96 1 0 118 75 21 0.768 3.78 6.05 Intr + 80303 80371 69 0 0 57 77 59 0.541 0.85 6.06 Term + 83454 83533 80 1 2 57 55 143 0.882 5.83 6.07 PlyA + 84426 84431 6 1.05 7.00 Prom + 84756 84795 40 -9.36 7.01 Init + 85220 85328 109 1 1 86 58 59 0.941 3.08 7.02 Term + 85357 85607 251 2 2 -26 55 373 0.768 18.47 7.03 PlyA + 85730 85735 6 1.05 8.00 Prom + 89560 89599 40 -7.56 8.01 Sngl + 94351 94548 198 0 0 109 42 219 0.843 14.63 8.02 PlyA + 100769 100774 6 1.05 9.02 PlyA - 100802 100797 6 1.05 9.01 Term - 105829 105590 240 1 0 39 49 180 0.186 5.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 27313 27441 129 0 0 117 45 66 0.836 3.38 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:6997752_7116850|GENSCAN_predicted_peptide_1|169_aa MGRSGGYAEVSSEEDPPPPSSSECIALSLPSALCPLSCREGPGAACATQSLPGLQIQLGQ LSGSRSGGVYHERRRTQSDVITNGKFLFRKCTSSLSLSLRTQGNHHKWIRRLLLGCSFIS SCLKWLCLSLQGHQEKHGSEEGKGPHGWGDLEEGEQEQTSRERGVSLDI >gi568815590r:6997752_7116850|GENSCAN_predicted_CDS_1|510_bp atggggcgcagcgggggctacgcagaggtgagctctgaggaggaccctccacctcccagc tcctcagagtgcatcgcgctgtccctcccctcagccctttgtccactgtcctgcagggag ggtcctggtgctgcctgtgccacacagtcactcccgggcctgcagatccagctgggacag ctcagcggcagcagatccggtggagtgtaccacgaaaggcgcaggacccaaagcgacgtt ataacaaacggcaaattcctcttccgcaaatgcacctcaagcctctcgctgagtctgagg acacagggaaatcatcataaatggatcagaaggctgctcctgggctgcagcttcatcagc tcttgcctgaagtggctctgcctgagcctacagggccaccaggagaagcatggcagtgag gagggcaaaggtcctcatggctggggtgacctggaggagggagagcaggagcagacgagt cgggagagaggagttagcctggatatatag >gi568815590r:6997752_7116850|GENSCAN_predicted_peptide_2|56_aa MGMSCTALVPQAGCSGPPLQLCVKAAGIGRWRPGGSGVITKASPSTLRVCSSHRPP >gi568815590r:6997752_7116850|GENSCAN_predicted_CDS_2|171_bp atgggaatgtcgtgcacagcgctggtcccacaggctgggtgctcagggccccccctgcag ctctgtgtgaaggcagcaggcatcggccggtggaggcctgggggcagtggagtgataact aaagcatctccctcaaccttaagagtttgcagcagtcatcgcccaccctag >gi568815590r:6997752_7116850|GENSCAN_predicted_peptide_3|160_aa MMNAISPGPTNAITQKPFTSIGATAGYSLLLDQQPDKIIVLSTLIPTGDYSPHNLKNLFM RMVTPAMRTLAILAAILLVALQAQAEPLQARADEVAAAPEQIAADIPEVVVSLAWDESLA PKHPGSRKNMDCYCRIPACIAGERRYGTCIYQGRLWAFCC >gi568815590r:6997752_7116850|GENSCAN_predicted_CDS_3|483_bp atgatgaatgcaatcagcccagggcccacaaatgccatcacccagaagcccttcacatcc attggggccacagcaggatacagtctgttgctggaccagcagcctgataaaatcattgtc ctctccaccctgattcctacaggagactactcaccccataacctcaaaaacctcttcatg aggatggtgaccccagccatgaggaccctcgccatccttgctgccattctcctggtggcc ctgcaggcccaggctgagccactccaggcaagagctgatgaggttgctgcagccccggag cagattgcagcggacatcccagaagtggttgtttcccttgcatgggacgaaagcttggct ccaaagcatccaggctcaaggaaaaacatggactgctattgcagaataccagcgtgcatt gcaggagaacgtcgctatggaacctgcatctaccagggaagactctgggcattctgctgc tga >gi568815590r:6997752_7116850|GENSCAN_predicted_peptide_4|124_aa MVLAHIQYLDPCLAALLAHALLRTLQVTTVRRTLTLLSAFLLVALQAWAEPLPARAHEMP AQKQPPADDQDVVLYFSGDDSCSLQVPGSTKGLICHCRVLYCIFGEHLGGTCFILGERYP ICCY >gi568815590r:6997752_7116850|GENSCAN_predicted_CDS_4|375_bp atggtgctggctcacatccagtacctggacccctgcttagctgctctcttagctcatgct ttgttgcgtaccctgcaggtgactacagttaggaggaccctcaccctcctctctgccttt ctcctggtggcccttcaggcctgggcagagccgctcccggcaagagctcatgagatgcca gcccagaagcagcctccagcagatgaccaggatgtggtcctttacttttcaggagatgac agctgctctcttcaggttccaggctcaacaaagggcttgatctgccattgcagagtacta tactgcatttttggagaacatcttggtgggacctgcttcatccttggtgaacgctaccca atctgctgctactaa >gi568815590r:6997752_7116850|GENSCAN_predicted_peptide_5|94_aa MRTIAILAAILLVALQAQAESLQERADEATTQKQSGEDNQDLAISFAGNGLSALRTSGSQ ARATCYCRTGRCATRESLSGVCEISGRLYRLCCR >gi568815590r:6997752_7116850|GENSCAN_predicted_CDS_5|285_bp atgaggaccatcgccatccttgctgccattctcctggtggccctgcaggcccaggctgag tcactccaggaaagagctgatgaggctacaacccagaagcagtctggggaagacaaccag gaccttgctatctcctttgcaggaaatggactctctgctcttagaacctcaggttctcag gcaagagccacctgctattgccgaaccggccgttgtgctacccgtgagtccctctccggg gtgtgtgaaatcagtggccgcctctacagactctgctgtcgctga >gi568815590r:6997752_7116850|GENSCAN_predicted_peptide_6|243_aa MQVCNCRCLSPGDLQEDGGAYWAQCSVWGYGKSDQSSPEEDCVLIEQVLIEQNKEIITGH LSAIQDLRRRHEKKTAETGWACRDEYITMIRIANQMFVNRDAVSHWELYMDGSSFINPQG ERGAGYAVVTLDTVVETRSLPQGTSAQKAELIAFIRDLELHEDIRKNVTGDVNSPAILGG VSSFPPLRIRNNITGKRKSNQECDTTHQDGYDKQTGIEQQKDDEGDDEDKEDDDRHHGIM NPY >gi568815590r:6997752_7116850|GENSCAN_predicted_CDS_6|732_bp atgcaggtctgcaactgcaggtgcctctcacctggtgacctgcaggaagatggaggagca tattgggcacagtgctcggtttgggggtacgggaaatcagatcagagcagcccagaggag gactgtgttctaatagagcaggttctaatagagcagaataaagagatcataactggccat ctgtcggccattcaagatctgcgcagaagacatgagaagaaaactgcagaaacaggctgg gcttgcagggatgaatacatcacaatgattagaatagctaaccagatgtttgtaaacagg gatgcagtaagccactgggaactatacatggatgggagtagcttcatcaacccacaagga gagagaggtgcagggtatgcagtggtaaccctggacactgttgttgaaaccagatcattg ccccagggcacttcagcccagaaagctgaactcattgctttcattcgggacttagaactc catgaagatattaggaaaaatgtaactggggatgtgaacagccctgcgatattgggagga gtatcatcctttccccccttgcgtattaggaacaatatcacagggaagcgcaaatcaaac caagaatgtgataccacacatcaggatggatatgataaacaaacaggcattgaacaacaa aaagatgatgaaggtgatgatgaggataaagaggatgatgacagacaccatggcatcatg aacccttactga >gi568815590r:6997752_7116850|GENSCAN_predicted_peptide_7|119_aa MAAAAATGTSPGSGPGDSPEGPEGEAHGASVEGAQNGEAAGLPAGPDLLDRTDLNRAHFD PEVYLDKLHRVCPLAQLMDSETDMVRQIRAIDSDMQTLVYENYDKFTPATEIDKQHKTV >gi568815590r:6997752_7116850|GENSCAN_predicted_CDS_7|360_bp atggcggcggcagctgccactggcactagcccggggtctggacctggggactccccagaa gggcctgagggggaggctcacggagcgtcggtggaaggcgcacagaatggggaggcagcg ggactccccgcggggcccgacctcctggaccgcactgatctgaacagggcgcacttcgac ccggaagtttacctagacaagctgcatagagtgtgccctctggcccagctgatggacagt gagacggacatggtgcggcagatccgggctatagacagcgacatgcagaccctggtctat gagaactacgacaagttcaccccagccacggaaattgacaaacagcataaaactgtatga >gi568815590r:6997752_7116850|GENSCAN_predicted_peptide_8|65_aa MASPSGSSEATGKPRGRDGRPRREEDDVPPEEKRLRLLLEGGSAQPEDCEDGEDALRPGR EETGT >gi568815590r:6997752_7116850|GENSCAN_predicted_CDS_8|198_bp atggccagcccttccggcagctccgaagccactggcaagccccgaggcagggatggccgg cccaggagggaggaggacgacgtccctcccgaagagaagaggctacggctgttgctggag gggggaagcgcacagcccgaggactgcgaggacggggaggacgcgctgcggccgggcagg gaggagaccggcacctag >gi568815590r:6997752_7116850|GENSCAN_predicted_peptide_9|79_aa VLEVALGSALKTMPLVKPQSSNDSQPRISNSPAPHVLTAHKAGAVNVLSTVISLNPHGSS RMQTVASHYSHFTFEETEA >gi568815590r:6997752_7116850|GENSCAN_predicted_CDS_9|240_bp gtactagaggtggctctgggctcggcactaaagacaatgcccctggtaaagccacagtct agcaatgacagtcaaccacgtatcagcaacagccctgccccacacgtgctgactgcgcac aaggccggcgctgtgaacgtgctctcaacagtgatctcactgaaccctcatggcagctct aggatgcagacagtagcatcacattattcccattttacttttgaggaaactgaggcctga