GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:56:23 Sequence gi568815596r:231632955_231881114 : 248160 bp : 48.17% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 20474 20513 40 -2.86 1.01 Init + 22739 22924 186 1 0 108 25 87 0.106 1.53 1.02 Intr + 29783 29969 187 1 1 88 57 84 0.223 4.56 1.03 Term + 32676 32905 230 1 2 49 43 159 0.157 4.29 1.04 PlyA + 32965 32970 6 1.05 2.05 PlyA - 34802 34797 6 1.05 2.04 Term - 43092 43052 41 1 2 94 48 52 0.186 -0.85 2.03 Intr - 53273 53138 136 2 1 58 90 85 0.493 5.84 2.02 Intr - 60010 59819 192 1 0 90 32 106 0.344 4.89 2.01 Init - 66925 66770 156 1 0 42 70 135 0.836 7.01 2.00 Prom - 67769 67730 40 -8.76 3.00 Prom + 75537 75576 40 -8.56 3.01 Init + 76573 76661 89 0 2 98 54 176 0.994 13.31 3.02 Intr + 76887 77110 224 0 2 92 57 80 0.619 3.07 3.03 Intr + 77231 77394 164 0 2 33 74 77 0.833 0.49 3.04 Intr + 78394 78465 72 0 0 61 68 105 0.696 5.40 3.05 Intr + 78933 79029 97 2 1 52 101 176 0.975 14.88 3.06 Intr + 79489 79562 74 2 2 77 90 137 0.999 11.93 3.07 Term + 79850 79897 48 1 0 65 42 156 0.985 6.00 3.08 PlyA + 80548 80553 6 1.05 4.07 PlyA - 80566 80561 6 1.05 4.06 Term - 81700 81647 54 1 0 65 43 79 0.176 -1.34 4.05 Intr - 84226 84104 123 1 0 2 61 110 0.112 0.48 4.04 Intr - 93509 93347 163 1 1 99 51 74 0.439 4.78 4.03 Intr - 104338 104233 106 2 1 129 92 97 0.733 13.47 4.02 Intr - 105184 105059 126 2 0 70 61 59 0.753 2.05 4.01 Init - 106227 106146 82 0 1 5 75 57 0.306 -2.77 4.00 Prom - 106959 106920 40 -5.46 5.00 Prom + 114477 114516 40 -5.06 5.01 Init + 115237 115356 120 0 0 53 57 118 0.714 5.39 5.02 Intr + 146390 146410 21 1 0 104 100 0 0.317 0.54 5.03 Term + 148348 148485 138 0 0 135 48 70 0.519 5.66 5.04 PlyA + 151373 151378 6 1.05 6.00 Prom + 155497 155536 40 -4.86 6.01 Init + 155617 155778 162 0 0 51 94 88 0.837 5.43 6.02 Intr + 158779 158854 76 0 1 108 95 45 0.857 6.29 6.03 Intr + 161309 161397 89 0 2 -37 102 122 0.817 0.79 6.04 Intr + 163023 163354 332 2 2 93 77 426 0.686 36.43 6.05 Intr + 165905 166010 106 2 1 82 94 169 0.947 17.12 6.06 Intr + 170065 170280 216 0 0 105 47 44 0.406 0.70 6.07 Term + 174533 174691 159 1 0 90 49 171 0.839 11.34 6.08 PlyA + 175947 175952 6 1.05 7.03 PlyA - 176757 176752 6 1.05 7.02 Term - 177893 177584 310 1 1 31 36 283 0.115 11.93 7.01 Init - 189503 189373 131 2 2 82 64 91 0.738 5.72 7.00 Prom - 233095 233056 40 -2.56 8.02 PlyA - 234678 234673 6 1.05 8.01 Term - 245148 244979 170 1 2 58 48 174 0.551 8.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:231632955_231881114|GENSCAN_predicted_peptide_1|200_aa MMARGSVAPRLALVGWPTAIPCLGVSAHSWQDAAAHCVSAHGWQDAAVHWNDPERELSAF IKNPGACAFLYVHNLGSVRIPLLTLPSTHTKTLACTNPRVSMFSYVHDLEDVLGPENVRI CVCTGPRTWVMSFSVNGPLLQHKAPEAAPHPTPLLSRRPELPPLNATLGKGSVPGKHLVG NALASRINNGDLDDATVLSL >gi568815596r:231632955_231881114|GENSCAN_predicted_CDS_1|603_bp atgatggcacggggctctgtagccccaaggctagcacttgtgggctggcctacagcaatc ccctgcctgggcgtctctgcacatagctggcaggatgctgctgcccactgcgtctctgca catggctggcaggatgctgctgtccactggaatgatccagaacgagagctctcagctttc ataaagaaccctggggcgtgcgcctttctgtatgtgcacaacctcggaagtgtgcgcata cccttgcttacactccctagcacgcacaccaaaactctggcatgcacgaacccgcgggtt tccatgttttcgtatgtgcacgacctcgaggatgtacttggacccgagaatgtccgcatt tgtgtttgcacaggccccaggacttgggtgatgtccttctccgtcaacggtcctctcctt cagcacaaggctccagaggcagctccccatcccaccccactcctgagcaggcggccggag ctgccccctctgaatgccacgctgggcaaggggtccgtgcctgggaaacacctggtgggc aatgctctggcctcaagaataaacaacggggatctagatgatgccaccgtcctcagtctc tga >gi568815596r:231632955_231881114|GENSCAN_predicted_peptide_2|174_aa MQHHPEVKPHQGGEGKVGAFPGKLESHAHVKSEGKAASQHLDRELPKLSSASQARSTLPR PVSRCACTCLYNHEHGSHVLNVATDVCMRVQMRACAHDTTNRSTTLSPPRGRGTKQGPVW QQLRLQAHPDSPFQDPAVPMLRQLRCSCAINASSVAWEGGPAAIAERLLCMLLG >gi568815596r:231632955_231881114|GENSCAN_predicted_CDS_2|525_bp atgcagcatcacccagaagtgaagcctcatcagggaggggaaggcaaggtgggcgccttt ccaggaaagctcgagtcccatgcacatgtgaagtccgaggggaaagctgccagtcagcac ctggaccgagagcttcccaagctgtcctcagcctcacaggcccgttccacactcccacgc ccagtctctcggtgtgcatgtacatgtctgtacaaccatgaacacggatcccatgtcctg aatgtggccacagatgtgtgcatgcgtgtgcagatgcgagcatgtgcccatgataccacc aaccgttcaaccaccctttcccctcctaggggccggggcaccaagcaggggcctgtttgg cagcagctcaggctgcaggctcatcccgactcccccttccaagacccagcagtgcccatg ctgcgccagctgcggtgctcttgtgccatcaatgcctcctctgtggcctgggagggtggg ccggctgcaattgcggagcgcctgctgtgcatgcttctggggtaa >gi568815596r:231632955_231881114|GENSCAN_predicted_peptide_3|255_aa MGAPRGCPGAPCAPQLGATRANTAAATGQWDWRESPWRLLNPCVPGVISDSPRGDFLAER SSGPGSPQGLSVTPSGLSPGMSLWGENTVPSAGPGCSIFSEAPKDIRACRGGGSGASSRE PGRPTRDPRACHCKLCLPAGSLQGKGRTRRPRATCSLRADLKEKKEVVEEAENGRDAPAN GNAENEENGEQEADNEVDEEEEEGGEEEEEEEEGDGEEEDGDEDEEAESATGKRAAEDDE DDDVDTKKQKTDEDD >gi568815596r:231632955_231881114|GENSCAN_predicted_CDS_3|768_bp atgggcgccccccgcggctgcccgggagcaccgtgtgcgccgcagctcggggcgacgcgg gccaacacggcggccgcgacaggccaatgggattggcgcgagtcaccttggcgtctcctt aacccttgtgtccctggcgtcatctctgactctcccaggggcgacttcttggcagagcgg agctcggggcccggatctccacaggggctctcagtgaccccttctggactcagtccggga atgagtttgtggggtgagaacaccgtccccagtgcggggcctggctgttcgattttctcc gaagcaccaaaagacattcgggcctgccggggtggcggcagtggggcgtcgagtcgagag cccggccgaccgacgcgcgacccgcgcgcgtgccactgcaagctctgcctgccggccggg agtctccaaggcaagggacgcactcggcggccccgggccacgtgctccctgcgcgcggac ttaaaggagaagaaggaagttgtggaagaggcagaaaatggaagagacgcccctgctaac gggaatgctgagaatgaggaaaatggggagcaggaggctgacaatgaggtagacgaagaa gaggaagaaggtggggaggaagaggaggaggaagaagaaggtgatggtgaggaagaggat ggagatgaagatgaggaagctgagtcagctacgggcaagcgggcagctgaagatgatgag gatgacgatgtcgataccaagaagcagaagaccgacgaggatgactag >gi568815596r:231632955_231881114|GENSCAN_predicted_peptide_4|217_aa MNLRDAETGKILWQGTEDLSVPGVEHEARVPKKILKCKAVSRELNFSSTEQMEKFRLEQK VYFKGQCLEEWFFEFGFVIPNSTNTWQSLIEAAPESQMMPASVLTPVDPSPLTSLHKHLV RVLAIPTGPTSGPVTISGLQHGPPFLTYPPHHSQKEQAKKKKLDIASGHYCQEKGEGMAA GGKNNRCPLGPYGIHQQLLSLPGDMLNKRPEQPHESD >gi568815596r:231632955_231881114|GENSCAN_predicted_CDS_4|654_bp atgaaccttcgggatgctgagacagggaagatactctggcaaggaacagaagacctgtct gtccctggtgtggagcatgaagcccgtgttcccaagaaaatcctcaagtgcaaggcagtg tctcgagaacttaatttttcttcgacagaacaaatggaaaaattccgcctggaacaaaaa gtttacttcaaagggcaatgcctagaagaatggttcttcgagtttggctttgtgatccct aactccacaaatacctggcagtccttgatagaggcagcacccgagtcccagatgatgcca gcaagcgtcttaaccccagtagaccctagcccgttgacctcacttcataaacatcttgtt cgggtcctcgccatccccacgggccccacctcaggtcctgtaaccatctcaggactgcag cacgggcctcccttcctgacttaccctccacatcacagccagaaagaacaagctaagaaa aaaaagctggacattgcttcagggcattactgccaagaaaagggtgaaggaatggctgct ggtggcaaaaacaatagatgtcccctgggaccttatggaatccaccagcagctcctttca ctgcctggggacatgctgaacaagcggcctgagcagccccatgagagtgactaa >gi568815596r:231632955_231881114|GENSCAN_predicted_peptide_5|92_aa MWESLELPRDLLNGFAQNADSNMDNKIQMEMRNLFGTGAKVQELRLKPRSALLEPMESQP WRLASQPASRSAQRLRRGTVNVNGKGPPREVP >gi568815596r:231632955_231881114|GENSCAN_predicted_CDS_5|279_bp atgtgggaaagtttggaacttcctagagacttgttgaatggctttgcccaaaatgctgat agcaatatggacaataaaattcagatggagatgaggaacttgttcggcactggagcaaag gttcaagagctaagactgaagcctcgcagtgccctcctggaaccaatggaaagccagcct tggcgattggcatctcaaccagccagtaggagcgcgcagcggcttcgcagggggactgtt aacgtcaatgggaagggccctccccgggaagtcccatag >gi568815596r:231632955_231881114|GENSCAN_predicted_peptide_6|379_aa MAGEQKPSSNLLEQFILLAKGTSGSALTALISQVLEAPGVYVFGELLELANVQELAEGAN AAYLQLLNLFAYGTYPDYIANKESLPELSTAQQNKLKHLTIVSLASRMKLLIITTSLRFP SLEAAVGYGSNTCIFANACQMVFIMITWPYLQCIPYSVLLKDLEMRNLRELEDLIIEAVY TDIIQGKLDQRNQLLEVDFCIGRDIRKKDINNIVKTLHEWCDGCEAVLLGIEQQVLRANQ YKENHNRTQQQVEAEDVYILPQSGNDCRGLRFVSSLIGGVRCVHLKHKGDAVVWEAIEEG EALPGKYRLKGGRSVWAAAVRAVCMDQVTNIKKTLKATASSSAQEMEQQLAERECPPHAE QRQPTKKMSKVKGLVSSRH >gi568815596r:231632955_231881114|GENSCAN_predicted_CDS_6|1140_bp atggcaggggaacagaaaccctcaagtaatctcctggagcagtttattttactagccaaa ggtaccagtggctcagccctcactgctctcataagccaggtcttagaggctcccggagtg tatgtctttggagaacttctggagctggccaacgtgcaggagcttgcggaaggagctaat gctgcttatttgcagttgttgaacctgtttgcctatgggacatacccagattacatagcc aacaaggagagcctgccagaactgagcacagctcagcagaacaagctgaagcatcttacc atcgtgagcttggcatcaagaatgaagctattaataattaccactagcctgcgtttcccg agcctagaggcagctgttggctatgggagtaatacctgcattttcgctaatgcttgccag atggtttttataatgatcacatggccttatctacagtgtatcccctactccgtgttgctg aaagacctggagatgcggaatctccgggaactagaagaccttatcattgaggctgtctac actgacatcatccagggcaagctggaccagcgaaaccagctgctggaagtggatttctgc attggccgtgacatccgaaagaaggatatcaataatattgtcaagaccctgcatgaatgg tgtgatggctgtgaagcagttctactgggcatcgagcagcaagttctgagagccaaccag tacaaagagaaccacaaccgaactcagcagcaggtagaagcagaggatgtttacattttg cctcaaagtggtaacgactgtagaggacttagatttgtcagcagtcttattggaggagtg aggtgtgtgcatctgaaacacaaaggagatgcagtggtgtgggaggccatcgaagaggga gaggcactgccaggcaagtacagattgaagggtggcagatcagtgtgggctgcagcagtc agggcagtttgtatggaccaggttaccaacatcaagaagacactcaaagccaccgcatcc tcctcggctcaggagatggagcagcagctggctgaacgggagtgtccccctcacgctgag cagaggcagcccaccaagaagatgtccaaagtgaaaggtctggtctccagccgccactag >gi568815596r:231632955_231881114|GENSCAN_predicted_peptide_7|146_aa MRYVQGGMAEDYTPIKWLKSKTVITPNAGKMWRNWIIPTQLMEIKSAPTATPFAAFTTSK GGRVLEVGFGMATRIWKVQEVAIQEHWIMECNDSIFQLLQDWTQQQPHKVIPLKVLWKEV ALPCQVVALMGSCMTRTHCRRRPGTH >gi568815596r:231632955_231881114|GENSCAN_predicted_CDS_7|441_bp atgcggtatgttcagggtggtatggctgaagactacacacctatcaaatggctaaaatct aaaacagtgataacacccaatgctggcaagatgtggagaaactggatcattcctacacag ctgatggaaatcaagagcgcccctactgcaacccccttcgctgccttcaccacctccaaa gggggccgagtcttagaggtgggcttcggcatggccacaagaatatggaaggtgcaggag gtcgccatccaggaacactggatcatggagtgcaatgacagcatcttccagctgctccag gactggacccagcagcagccacacaaggtcatccccttgaaagtcctgtggaaggaggtg gcgctcccctgccaggtggtcgctttgatgggatcctgtatgacgcgtacccactgtaga aggagacctggcacacactag >gi568815596r:231632955_231881114|GENSCAN_predicted_peptide_8|56_aa XEENALEKAFAFTLVQSEKALSTSEWKTPTFRGLEQVSAGILRTGFVPVSGQLPKH >gi568815596r:231632955_231881114|GENSCAN_predicted_CDS_8|171_bp ncagaagaaaatgctctagagaaagcttttgccttcacgctggtacagagtgagaaagca ctgtcaacatctgagtggaaaaccccaacattccgcggcctcgagcaggtctctgctgga atcctgaggacaggctttgttcctgtgtctggccagctgcccaaacactga