GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:20:43 Sequence gi568815577r:36360979_36561695 : 200717 bp : 46.58% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3178 3281 104 0 2 84 82 38 0.544 1.77 1.02 Intr + 8010 8898 889 0 1 42 105 541 0.972 42.52 1.03 Intr + 11396 11553 158 1 2 44 85 125 0.964 6.61 1.04 Term + 14165 14318 154 2 1 71 39 136 0.990 4.39 1.05 PlyA + 14983 14988 6 -0.45 2.00 Prom + 15333 15372 40 -4.26 2.01 Init + 24374 24473 100 1 1 41 96 196 0.719 14.12 2.02 Intr + 25184 25284 101 0 2 1 59 150 0.676 3.23 2.03 Intr + 26620 26752 133 0 1 38 101 82 0.995 4.72 2.04 Intr + 30573 30690 118 1 1 59 35 157 0.923 7.02 2.05 Intr + 33569 33672 104 2 2 76 94 69 0.887 6.12 2.06 Intr + 36437 36533 97 0 1 86 99 -7 0.492 -0.73 2.07 Intr + 38543 38627 85 2 1 49 60 76 0.451 0.72 2.08 Intr + 41780 41873 94 1 1 48 89 68 0.369 2.44 2.09 Intr + 50485 50626 142 2 1 89 101 170 0.999 17.81 2.10 Intr + 51852 52337 486 0 0 35 111 214 0.307 10.53 2.11 Intr + 54317 54411 95 2 2 39 111 34 0.730 0.41 2.12 Term + 55545 55675 131 1 2 68 47 83 0.489 0.54 2.13 PlyA + 58016 58021 6 1.05 3.02 PlyA - 60242 60237 6 -0.45 3.01 Sngl - 61789 61412 378 1 0 46 49 268 0.822 14.88 3.00 Prom - 65817 65778 40 -4.26 4.04 PlyA - 67083 67078 6 1.05 4.03 Term - 72242 72157 86 0 2 62 35 95 0.253 -0.58 4.02 Intr - 83463 83305 159 1 0 70 72 115 0.407 8.06 4.01 Init - 85065 85044 22 0 1 97 87 12 0.641 1.86 4.00 Prom - 97534 97495 40 -3.56 5.02 PlyA - 97541 97536 6 1.05 5.01 Sngl - 100717 99998 720 1 0 110 47 1170 0.998 109.33 5.00 Prom - 116748 116709 40 1.14 6.04 PlyA - 117129 117124 6 1.05 6.03 Term - 125920 125341 580 0 1 68 49 297 0.108 17.76 6.02 Intr - 132619 132514 106 2 1 122 18 42 0.001 -0.13 6.01 Init - 148277 148247 31 2 1 105 80 50 0.555 5.84 6.00 Prom - 160953 160914 40 -3.66 7.00 Prom + 161196 161235 40 -7.86 7.01 Init + 162927 163026 100 2 1 69 60 71 0.503 2.83 7.02 Term + 167880 168022 143 1 2 136 39 66 0.807 4.59 7.03 PlyA + 168856 168861 6 1.05 8.00 Prom + 170594 170633 40 -3.06 8.01 Init + 188391 188464 74 2 2 52 90 141 0.926 11.24 8.02 Intr + 190307 190383 77 2 2 -9 79 76 0.499 -3.84 8.03 Intr + 192671 192994 324 0 0 63 22 211 0.374 7.75 8.04 Intr + 193352 193460 109 0 1 74 96 50 0.503 3.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577r:36360979_36561695|GENSCAN_predicted_peptide_1|434_aa ESVPRRHLSEGTNSYATRLLNNHQVPPQSEPESNSLKRRLSTRSSILNAKNRRLSSQFEN SVYKGDDDDEDVIILEENSTPKPAVDHDIDMKSEQSHVEQGGVQVEFVGDSEPCGQTGST STSSSRCDQGNTAATQTEVPSLVVKKEETVEDEIDVRNDAVILPSCVEAEAKIHETQETT DKSADDAGCQLQELRNQLLLVTEEKENYKRQCHMFTDQIKVLQQRILEMNDKYVKKETCH QSTETDAVFLLESINGKSESPDHMVSQYQQALEEIERLKKQCSALQHVKAECSQCSNNES KSEMDEMAVQLDDVFRQLDKCSIERDQYKSEVELLEMEKSQIRSQCEELKTEVEQLKSTN QQTATDVSTSSNIEESVNHMDGESLKLRSLRVNVGQLLAMIVPDLDLQQVNYDVDVVDEI LGQVVEQMSEISST >gi568815577r:36360979_36561695|GENSCAN_predicted_CDS_1|1305_bp gaaagtgttccaagaagacatctttcagaaggaacaaattcttatgcgacaagacttcta aataatcatcaagttccacctcagtctgaacctgagagcaacagcttgaaacggagactt tctactcgttcctcaattttgaatgcaaagaatcggagattgagtagtcagtttgaaaat tcagtttataaaggtgatgatgatgatgaagatgtcatcatcttagaagaaaacagtacc cccaaacctgcagtagatcatgatattgacatgaaatcagaacagagtcacgttgagcaa ggtggtgttcaggttgagtttgtgggtgacagtgaaccttgtggccagactggttcaaca agcacctcatcatcccgatgcgaccagggaaatactgcagctacccagactgaagtacca agtttagttgttaaaaaagaagaaactgttgaagacgagatagacgtaagaaatgatgca gtgattctgccctcctgtgtagaagctgaagcaaagatacatgaaacccaggaaaccacc gataaatctgcagatgatgcaggctgccaattacaagaactgagaaaccagctactcctt gtcactgaggaaaaagagaattataaaagacagtgtcatatgtttactgatcaaatcaaa gtgttacaacagaggatactagaaatgaatgacaagtatgttaagaaagaaacttgccat cagtccactgaaaccgatgctgtatttttacttgaaagtattaatggcaaatctgaaagt ccagaccatatggtatctcagtatcagcaagctttggaagaaatagaaaggctgaaaaaa caatgtagtgctttgcaacatgtaaaggctgaatgcagccagtgttccaataatgagagt aaaagtgaaatggatgagatggctgtgcagcttgacgatgtgtttagacaactggacaaa tgcagtattgagagggaccagtataaaagtgaggttgaattgctggaaatggaaaagtca caaatccgttcacagtgtgaagaactcaaaactgaagtagaacagttaaaatctacaaat caacagacggcaacagatgtttcaacatcaagtaacattgaggagtctgtaaatcatatg gatggagaaagcctcaaactccgatctcttcgagttaacgtaggacaactgctggctatg attgtgcctgatcttgatcttcagcaagtgaattacgatgttgatgtagttgatgagatt ttaggacaagttgttgaacaaatgagtgaaatcagtagtacttaa >gi568815577r:36360979_36561695|GENSCAN_predicted_peptide_2|561_aa MRSRPCGGGACGAREAARAAREVTVPLTVRVPPAWHNKEPVYSLDFQHGTAGRIHRLASA GVDTNVRIWKVEKGPDGKAIVEFLSNLARHTKAVNVVRFSPTGEILASGGDDAVILLWKV NDNKEPEQIAFQDEDEAQLNKENWTVVKTLRGHLEDVYDICWATDGNLMASASVDNTAII WDVSKGQKISIFNEHKSYVQGVTWDPLGQYVATLSCDRVLRVYSIQKKRVAFNVSKMLSG IGAEGEARSYRMFHDDSMKSFFRRLSFTPDGSLLLTPGVELMSLPYRLVFAVASEDSVLL YDTQQSFPFGYVSNIHYHTLSDISCRCESVGKVCNFSLFWGRRSSDGAFLAISSTDGYCS FVTFEKDELGIPLKEKPVLNMRTPDTAKKTKSQTHRGSSPGPRPVEGTPASRTQDPSSPG TTPPQARQAPAPTVIRDPPSITPAVKSPLPGPSEEKTLQPSSQNTKAHPSRRVTLNTLQA WSKTTPRRINLTPLKTDTPPSSVPTSVISTPSTEEIQSEIWMLLYSVSIFNLGHERFNVV NPLFDEFLKLERFNVIQCENQ >gi568815577r:36360979_36561695|GENSCAN_predicted_CDS_2|1686_bp atgaggtcccgcccatgcgggggcggggcctgcggcgcgcgggaagcggcgcgcgctgcg cgggaggtgacggtgcctctgactgtccgggtccctccagcctggcacaacaaggagccc gtgtacagcctggacttccagcatgggacggctgggaggatccacagactggcgtctgcc ggcgtggacaccaatgtcaggatctggaaggtagaaaagggaccagatggaaaagccatc gtggaatttttgtccaatcttgctcgtcataccaaagccgtcaatgttgtgcgtttttct ccaactggggaaattttagcatcgggaggagatgatgctgtcatcctattgtggaaggtg aatgataacaaggagccggagcagatcgcttttcaggatgaggacgaggcccagctgaac aaggagaactggacggttgtgaagactctgcggggccacttagaagatgtgtatgatatt tgctgggcaactgatgggaatttaatggcttctgcctctgtggataacacagccatcata tgggatgtcagcaaaggacaaaagatatcaatttttaatgaacataaaagttatgtccaa ggagtaacctgggaccctttgggtcaatatgttgctactctgagctgtgacagggtgctg cgagtatacagtatacagaagaagcgtgtggctttcaatgtttcgaagatgctgtctgga ataggggctgaaggagaggcaagaagctaccggatgtttcacgacgacagcatgaagtct ttcttccgtagactgagtttcactcccgacggatctttgcttctcacgccaggtgtggag ctgatgagtctgccctaccgcctggtgtttgctgtggcctcggaggattccgtgcttctg tatgacacccagcagtccttcccttttggttacgtgtctaatatacattaccacaccctc agtgacatttcatgtagatgtgagagtgttggtaaggtctgtaacttttccctgttttgg ggacgaaggtccagcgatggtgccttcctggccatttcttccacggacggttactgctca tttgtgacatttgagaaagatgaacttggaattcctttgaaagagaagccagttttgaac atgagaactcctgatacagcaaagaaaaccaagagtcagacacatcgagggtcttcgcca ggacccagaccggtagagggaacccctgccagcagaacccaagaccccagcagccccggc acgactccccctcaggccagacaggccccagccccaacagtcatcagggaccctccctcc atcactcctgctgtcaaaagccccttgccggggccttcggaggagaagaccctgcagccc agtagtcaaaacacaaaagcccacccatcccggagggtcactctgaacacactgcaagcc tggagcaagacaacaccccggagaataaacttaacacccttaaagacggacactccacca agttctgtaccaaccagtgtgatttccaccccttctacagaagaaattcagtcagaaata tggatgctgttgtattcagtatccatttttaacttgggacatgaacgttttaacgtagta aatcctctttttgatgagtttctgaaactggagcggttcaacgttatccagtgtgaaaat cagtga >gi568815577r:36360979_36561695|GENSCAN_predicted_peptide_3|125_aa MWRGSGPVTSRIRRQATSKAALNIEDSVEIKRTNNSNASGVNDNQQTCQSWRHGRTLDGT VTGTAAYLIRGRQGIHSSFQQQSPTESRDKRPGPGKETGTGLFGASRREGLAIPPPSGFR KCPAC >gi568815577r:36360979_36561695|GENSCAN_predicted_CDS_3|378_bp atgtggcggggcagtggcccagtgacatcaaggattaggagacaggccacgtctaaggcg gccttgaatatcgaggattctgtggaaatcaaaaggacaaacaactcaaatgcctctggt gtcaacgacaaccagcagacatgtcagtcctggaggcatggtcggactctggatgggaca gtcacgggaacagctgcctatctgataaggggccgccagggaattcacagcagtttccaa cagcagagccccacagagagcagggacaagagacctgggcccgggaaggagacagggaca ggactctttggggcaagtaggagagaaggattagcaataccgccccctagtggcttccgc aagtgcccggcctgttag >gi568815577r:36360979_36561695|GENSCAN_predicted_peptide_4|88_aa MGNRSIERTWMKLEAIILSRQTQEQKTKRLMYSLISGSRTMRTRGHREGNITHRGLSGVE ECGSSDKFDQFPAVICTATRQCLSYHYE >gi568815577r:36360979_36561695|GENSCAN_predicted_CDS_4|267_bp atggggaacagaagtattgaaaggacatggatgaagctggaagccatcatcctcagcaga caaacacaggaacagaaaaccaaacgcctcatgtactcactcataagtggaagtcgaaca atgagaacacgtggacacagggaggggaacatcacacaccggggcctgtcgggggtggag gaatgtggatcttcagacaagtttgatcagttcccagctgtcatctgtacagccacacga cagtgcctttcttaccattatgagtaa >gi568815577r:36360979_36561695|GENSCAN_predicted_peptide_5|239_aa MASTAVQLLGFLLSFLGMVGTLITTILPHWRRTAHVGTNILTAVSYLKGLWMECVWHSTG IYQCQIYRSLLALPQDLQAARALMVISCLLSGIACACAVIGMKCTRCAKGTPAKTTFAIL GGTLFILAGLLCMVAVSWTTNDVVQNFYNPLLPSGMKFEIGQALYLGFISSSLSLIGGTL LCLSCQDEAPYRPYQAPPRATTTTANTAPAYQPPAAYKDNRAPSVTSATHSGYRLNDYV >gi568815577r:36360979_36561695|GENSCAN_predicted_CDS_5|720_bp atggccagcacggccgtgcagcttctgggcttcctgctcagcttcctgggcatggtgggc acgttgatcaccaccatcctgccgcactggcggaggacagcgcacgtgggcaccaacatc ctcacggccgtgtcctacctgaaagggctctggatggagtgtgtgtggcacagcacaggc atctaccagtgccagatctaccgatccctgctggcgctgccccaagacctccaggctgcc cgcgccctcatggtcatctcctgcctgctctcgggcatagcctgcgcctgcgccgtcatc gggatgaagtgcacgcgctgcgccaagggcacacccgccaagaccacctttgccatcctc ggcggcaccctcttcatcctggccggcctcctgtgcatggtggccgtctcctggaccacc aacgacgtggtgcagaacttctacaacccgctgctgcccagcggcatgaagtttgagatt ggccaggccctgtacctgggcttcatctcctcgtccctctcgctcattggtggcaccctg ctttgcctgtcctgccaggacgaggcaccctacaggccctaccaggccccgcccagggcc accacgaccactgcaaacaccgcacctgcctaccagccaccagctgcctacaaagacaat cgggccccctcagtgacctcggccacgcacagcgggtacaggctgaacgactacgtgtga >gi568815577r:36360979_36561695|GENSCAN_predicted_peptide_6|238_aa MADTGSAFTAGLEELSFPCSLKLTLACSVLSFLDYKNLKDQTCRPRHRPHPVQATHRPTQ GQDHLLHWHPHGPSGSEALTGQESQDAYHCLRGKPKGNEKDLVKLAKFLKKEKVNVDMIN LGEEEVNREKLTAFVNTLNGKDGSGSHLVAVPPGPSLADALISSPILAGEGGAMLGLGAS DFEFGVDPSADPQLALALGFLWRSSSSGRMRSPDEQLQPLLLRPGLLQLGLKAQTRSC >gi568815577r:36360979_36561695|GENSCAN_predicted_CDS_6|717_bp atggcggacaccggaagtgccttcacagcaggactggaagaactgagcttcccttgctcc ttgaagctgacactggcatgcagtgtcctgagttttctggattataagaacctcaaagat cagacctgcaggcccagacaccggccgcatcctgtccaagctacacaccgtccaacccaa gggcaagatcaccttctacactggcatccacatggcccatctggctctgaagcgctgacg gggcaagaatcacaagacgcatatcattgccttcgtgggaagcccaaaggcaatgagaag gatctggtgaaactggctaaattcctcaagaaggagaaagtaaatgttgacatgataaat ttgggggaagaggaggtgaacagagaaaagctgacagcctttgtaaacacgttgaatggc aaagatggaagcggttctcatctggtggcagtgcctcctgggcccagcttggctgatgct ctcatcagttctccgattttggctggggaagggggtgccatgctgggtctcggtgccagt gactttgaatttggagtagatcccagtgctgatcctcagctggccttggccctcgggttt ctatggaggagcagcagcagcggcaggatgaggagcccagacgagcagctgcagcctctg ctgctgaggccaggattgctacaactgggactgaaggctcagacgaggtcctgctga >gi568815577r:36360979_36561695|GENSCAN_predicted_peptide_7|80_aa MLQPCRLLECDLRDTKQHSYHPECQFQTLNGDTGSICCSVSRNPAWPSTKPEEDASSAAA DSIQQQNSKQGGHLPRRLPP >gi568815577r:36360979_36561695|GENSCAN_predicted_CDS_7|243_bp atgctccagccctgccggctcctggagtgtgacctgagggacacaaagcagcactcttac caccctgaatgccagttccaaacactgaatggggacacaggcagcatctgctgtagtgtg agccgcaaccctgcctggccctccaccaagcctgaggaggacgcgagctcggctgcggct gacagcattcagcagcagaactccaagcaagggggtcacctgcccaggagactaccccct tag >gi568815577r:36360979_36561695|GENSCAN_predicted_peptide_8|195_aa MLQNEKDVFGVEKEIDEEKERVAPGEDPSEISEQMHCSLRTGFIPGASASPKQKTAKLLT LAVSLLAPRRTCYSQQDLLQSAAPAKKIFLLPFMVRTTVCEMRDEAWPAHIIPSVETKVT PSWMLIAMLTLFSHRPENVDVIVHIIGYGEYSVLDSPGGLVPGAPGTQTQSQKKGLSRYN HNTGLQVPREESRGS >gi568815577r:36360979_36561695|GENSCAN_predicted_CDS_8|585_bp atgctgcaaaatgagaaagatgtatttggtgttgaaaaggaaattgatgaggagaaggag cgtgtggcccccggggaggaccccagtgagatctccgagcagatgcactgttccctgcgg acaggatttattcctggagcctctgcctcccccaagcagaagacagctaaacttctgaca ttggcggtctcactgctagcccctagaaggacctgctacagtcagcaggacctgctacag tcagcagcacctgccaagaagatattcttactcccctttatggtgaggacaactgtgtgt gagatgagggatgaggcatggccggcccacatcattccatctgtggagacaaaagtgact ccatcttggatgctaatcgccatgttgactctgtttagccaccgccctgagaatgttgat gtcatcgtgcatatcattggctatggcgaatatagcgtcctcgacagtcctggaggcctg gtccctggcgctccaggaacacagacccagtctcaaaagaagggcttgtccaggtacaac cacaacacagggttacaggtgccccgtgaggaaagcagaggcagn