GENSCAN 1.0 Date run: 7-Nov-116 Time: 15:55:57 Sequence gi568815592r:106871664_107074646 : 202983 bp : 44.41% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 12217 12275 59 2 2 110 36 111 0.762 5.95 1.02 PlyA + 12297 12302 6 1.05 2.00 Prom + 17552 17591 40 -4.56 2.01 Init + 28190 28265 76 1 1 41 81 49 0.099 0.75 2.02 Intr + 32748 32928 181 1 1 74 58 67 0.082 1.23 2.03 Intr + 41491 41618 128 1 2 89 96 23 0.014 3.52 2.04 Intr + 46612 46687 76 2 1 68 74 39 0.120 -0.93 2.05 Term + 53761 53956 196 1 1 96 36 133 0.119 5.78 2.06 PlyA + 56184 56189 6 1.05 3.02 PlyA - 57556 57551 6 1.05 3.01 Sngl - 60383 60063 321 2 0 103 44 294 0.690 22.49 3.00 Prom - 62244 62205 40 -5.56 4.00 Prom + 65029 65068 40 -2.16 4.01 Init + 72809 73026 218 1 2 55 15 147 0.537 2.27 4.02 Intr + 73094 73221 128 2 2 55 95 68 0.515 4.52 4.03 Term + 87543 87661 119 1 2 63 46 82 0.128 0.20 4.04 PlyA + 89203 89208 6 1.05 5.04 PlyA - 89484 89479 6 1.05 5.03 Term - 101073 100913 161 1 2 77 49 44 0.470 -2.50 5.02 Intr - 102163 102007 157 1 1 43 102 107 0.943 7.18 5.01 Init - 102983 102915 69 2 0 96 94 197 0.930 20.15 5.00 Prom - 103105 103066 40 -4.06 6.00 Prom + 108867 108906 40 -3.86 6.01 Init + 140433 140621 189 2 0 60 62 263 0.930 19.91 6.02 Intr + 142424 142546 123 1 0 102 77 51 0.753 6.18 6.03 Intr + 156430 156608 179 0 2 43 109 16 0.041 -2.08 6.04 Intr + 168346 168567 222 1 0 31 87 174 0.161 8.84 6.05 Intr + 172597 172669 73 1 1 75 93 28 0.936 1.41 6.06 Term + 179394 179573 180 2 0 120 36 129 0.960 8.51 6.07 PlyA + 179653 179658 6 1.05 7.02 PlyA - 179934 179929 6 1.05 7.01 Sngl - 199272 197041 2232 0 0 85 49 4683 0.935 457.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:106871664_107074646|GENSCAN_predicted_peptide_1|19_aa XPEKPVAEKLLAQLVLISP >gi568815592r:106871664_107074646|GENSCAN_predicted_CDS_1|60_bp nggcctgagaaacctgtggctgagaagctgctggcccagctggtcctcatcagcccataa >gi568815592r:106871664_107074646|GENSCAN_predicted_peptide_2|218_aa MDNRKLLLTVLEAEKSKIKAPDSVSATPPQTMTTSTFQTLKPKPVIFSMTSAFIPTSNSS GNPVGPTLQIYPESDHFSPPPGLLPIILSSNTTFSKCLKQKGSVCLLWEQQLIMGWGSLK EEGKPEEQAIAERSSCQCQKETGGLGNMLAQRSRKSLLTPNTEEEQGEEEEEKEKEKEKE KEWRRRRKKEKGRRNSSGRKRKKGKGEDILKINWILLA >gi568815592r:106871664_107074646|GENSCAN_predicted_CDS_2|657_bp atggacaacagaaagttattgctcacagttctagaggctgagaagtccaagatcaaggca ccagattcagtgtctgccacccctcctcagacgatgacaacttcaaccttccagacactt aagccaaaacccgtgatattctccatgacatctgccttcattccaacatcaaattcatca ggaaatcctgtaggccctaccttgcaaatatatccagaatctgaccacttctcaccacca ccagggctcctccccattatactttccagtaacacaaccttttcaaaatgcctaaagcaa aaaggatctgtgtgtttgctttgggaacaacagttaatcatgggctggggcagcctgaaa gaagaaggaaaacctgaggaacaggccattgcggagagaagcagctgccagtgtcaaaag gaaacaggtggccttgggaacatgcttgctcagaggagtcggaaaagtttgctaactcct aatacagaagaagaacaaggagaagaggaggaggaaaaagagaaggaaaaagagaaagaa aaagaatggaggagaagaaggaagaaagaaaaaggaaggaggaatagctcagggaggaag agaaagaagggaaaaggagaagacattctgaaaataaattggattctcttggcataa >gi568815592r:106871664_107074646|GENSCAN_predicted_peptide_3|106_aa MPKRKAEGDAKGDKAKGKDEPQGRSARLSAKPAPPKPEPKPKKAPAKKGEKVPKGKKGKV DAGKEGNSPAENGDAKTDQAQKAEGAGDAKRSVWIFDNCTYGDCTV >gi568815592r:106871664_107074646|GENSCAN_predicted_CDS_3|321_bp atgcccaagagaaaggctgaaggggatgctaaaggggataaagccaaagggaaggatgaa ccacagggaagatctgcgaggttgtctgctaaacctgctcctccaaagccagagcccaag cctaaaaaggcccctgcaaagaagggagagaaggtacccaaagggaaaaaggggaaagtt gatgctggcaaggaggggaatagccctgcagaaaatggagatgccaaaacagaccaggca cagaaagctgaaggtgctggagatgccaagcgaagtgtgtggatttttgataactgtacg tatggtgactgtacagtttga >gi568815592r:106871664_107074646|GENSCAN_predicted_peptide_4|154_aa MVDAARFEFLHMEMEAELWDHKPNPVHGGTEDKPVDPGEHGFLCEPGSRRETAPEDARLR EELGVLKFICKNLTTASTPHPDDLGPGFTCGLLHGTLSTLGMESLVTASVASLTTSNRRA KLQDLVKENPPNSHRGNANAVTKSKEPPDMGFVG >gi568815592r:106871664_107074646|GENSCAN_predicted_CDS_4|465_bp atggtggatgctgcacggttcgagtttctacacatggagatggaagcagagctgtgggat cacaaacccaaccctgtccatggcgggacagaagataagcctgttgatcctggagagcat ggatttctgtgtgagccaggttctaggcgagagactgcccctgaagatgctcgtctcagg gaggagctgggtgtcctcaagttcatatgcaaaaatctgacaacagcttccactcctcat cccgatgatctggggcctggattcacttgtggcctcctgcatggcaccctcagtaccctg ggcatggagagtttggtcactgcctctgtggcatccctgaccacctcgaacagaagagcc aaactccaggatcttgtcaaagagaaccctccaaactctcatagaggaaatgctaatgca gtgaccaagagcaaggagccacccgacatgggcttcgtagggtga >gi568815592r:106871664_107074646|GENSCAN_predicted_peptide_5|128_aa MGRAMVARLGLGLLLLALLLPTQRGLPRSSIRGTSPRVRVPGAAQRCLEGVWMHRAGERD RGLGLGDPSGGGGEEVRAENSDLPLFHDKGEIITCPTGLCISWLEKSVVNDATQTLGHWE NLGSWPQQ >gi568815592r:106871664_107074646|GENSCAN_predicted_CDS_5|387_bp atgggcagagcaatggtggccaggctcgggctggggctgctgctgctggcactgctccta cccacgcagcgcggtctccctagatcctccatccggggaacctcgccccgggtgcgggta cccggggccgcgcagcgctgcctcgagggtgtatggatgcaccgcgccggcgagagagac cgggggctgggcctgggagaccctagcgggggcgggggcgaagaagtgagggctgaaaac tccgatctgcctttgttccatgacaaaggagagattatcacgtgccctactggactgtgc atttcgtggcttgaaaaatcagtggtgaatgatgccactcagacgctgggacactgggag aaccttggttcttggccacagcaatga >gi568815592r:106871664_107074646|GENSCAN_predicted_peptide_6|321_aa MKKFCKGDESLEDEEHSGRPLEVDNDQLRAIIEGDPLTTTGEVAEELNVNHSTVVQHLKC IGKATEQYVYVAQGLETPVLYRRVDVFSHCQPTSPLTECSKGSVSSKTRPPAGVAASLPP WPRAPPDPLSLRRRRRKGRGADRGSLEACSPRGRGEGREGLTYRLKSNIRSTKSTKKSLQ KVDEEDSDEESHHDEMSEQEEELEDDPTVVKNYKDLEKAVQSFRYDVVLKTGLDIGRNKV EDAFYKGELRLNEEKLWKKSRTVKVGDTLDLLIGEDKEAGTETVMRILLKKVFEEKTESE KYRVVLRRWKSLKLPKKRMSK >gi568815592r:106871664_107074646|GENSCAN_predicted_CDS_6|966_bp atgaagaagttttgcaaaggagatgagagccttgaagatgaggagcatagtggccggcca ttggaagttgacaacgaccaattgagagcaatcatcgaaggtgatcctcttacaactaca ggagaagttgctgaagaactcaacgtcaaccattctacagttgttcagcatttaaagtgc attggaaaggccacagagcagtacgtgtatgtggcccaggggttggagactcctgtgtta tatagacgtgttgatgtgttttctcattgccaacccacttcacccctcacagaatgcagc aagggcagcgtgtcctccaagacgcggcctccagcaggggtcgctgcttcgctgccgccc tggcctcgcgccccgcccgacccgctctcactgcgccggcgccggcggaaggggcggggc gcagataggggtagcctggaggcctgcagtccgcgcggccgcggggagggacgagagggc ctgacgtacagactcaaaagtaatataaggtctacaaaatctactaaaaagtctctgcaa aaagtagatgaagaggactctgatgaagaaagccatcatgatgagatgagtgagcaggaa gaggagcttgaggatgatcctactgtagtcaaaaactataaagacctggaaaaagcagtt cagtcttttcggtatgatgttgtcctgaagacggggctagatattgggagaaacaaagtg gaagatgctttctacaaaggtgaactcaggctgaatgaggaaaaattatggaagaaaagc agaacggtgaaagtgggagatacattggatcttctcattggagaggataaagaagcagga acagagacagttatgcggattctcttgaaaaaagtgtttgaagagaagactgaaagtgaa aaatacagagtggtgttacggcggtggaaaagtttaaagttgcctaagaagagaatgtct aaataa >gi568815592r:106871664_107074646|GENSCAN_predicted_peptide_7|743_aa MRNRENSSPCQGNGEQAGRGRSLGNVWPGEEEPCNDATTPSYKKPLYGISHKIMEKKNPP SGDLLNVYELFEKANASNSPSSLRLLNEPQKRDCGSTGAGTDNDPNIYFLIQKMFYMLNT LTSNMSQLHSKVDLLSLEVSRIKKQVSPTEMVAKFQPPPEYQLTAAELKQIVDQSLSGGD LACRLLVQLFPELFSDVDFSRGCSACGFAAKRKLESLHLQLIRNYVEVYYPSVKDTAVWQ AECLPQLNDFFSRFWAQREMEDSQPSGQVASFFEAEQVDPGHFLDNKDQEEALSLDRSST IASDHVVDTQDLTEFLDEASSPGEFAVFLLHRLFPELFDHRKLGEQYSCYGDGGKQELDP QRLQIIRNYTEIYFPDMQEEEAWLQQCAQRINDELEGLGLDAGSEGDPPRDDCYDSSSLP DDISVVKVEDSFEGERPGRRSKKIWLVPIDFDKLEIPQPDFEVPGADCLLSKEQLRSIYE SSLSIGNFASRLLVHLFPELFTHENLRKQYNCSGSLGKKQLDPSRIKLIRHYVQLLYPRA KNDRVWTLEFVGKLDERCRRRDTEQRRSYQQQRKVHVPGPECRDLTSYAINPERFREEFE GPPLPPERSSKDFCKIPLDELVVPSPDFPVPSPYLLSDKEVREIVQQSLSVGNFAARLLV RLFPELFTAENLRLQYNHSGACNKKQLDPTRLRLIRHYVEAVYPVEKMEEVWHYECIPSI DERCRRPNRKKCDILKKAKKVEK >gi568815592r:106871664_107074646|GENSCAN_predicted_CDS_7|2232_bp atgcggaaccgtgagaacagctcgccctgccaaggcaatggtgagcaggccggcaggggc aggagcctgggcaatgtgtggcctggagaggaggagccctgcaacgatgccaccacccct tcctacaagaagcctctgtatggcatctcgcacaagatcatggagaagaagaatcctccc tcgggggacctgctaaacgtgtacgagctctttgagaaggcaaacgccagcaacagcccc tcgtcactgcggctcctgaatgagccacagaagcgggactgtggcagcaccggggcaggc actgacaacgaccccaacatctacttcctgatccagaagatgttctacatgctgaacacc ctcacgtccaacatgtcccagctgcacagcaaggtggacctgctctcccttgaggtgagc cgcatcaagaagcaggtgagccccactgagatggtggccaaattccagccgccccctgag taccagctcacagccgcagagctcaagcagatcgtggaccagagcctgtcagggggggac ctggcctgccgcttgctggtgcagctcttccccgagctcttcagcgacgtggacttctcc cggggctgcagtgcctgtggctttgcggccaagcgcaagctggagtcgctgcacctgcag ctcatccgcaactatgtggaggtctactacccctcggtgaaggacacggctgtgtggcag gccgagtgcctgccccagctgaacgacttcttcagccgcttctgggcccagcgggaaatg gaggacagccagcccagcggccaggtcgccagcttctttgaggcagagcaggtggacccc ggccacttcctggacaacaaagaccaggaggaggccctgtctcttgaccggagcagcacc atcgcctcagaccacgtggtggacacgcaggacctcactgagttcctggacgaagcctcc tcaccaggcgagtttgccgtcttcctcctccaccggctcttccccgagctcttcgaccac cgcaagctgggtgaacagtacagctgctacggggacggtggaaagcaggagctggacccg cagcggctgcagatcatccgcaactacacggagatctacttccctgacatgcaggaggag gaggcctggctgcagcagtgtgcccagcgcatcaacgacgagctcgagggcctggggctg gacgcgggcagtgaaggcgaccccccgcgtgatgactgctacgactcctccagtctgccc gacgacatctcagtggtcaaggtggaggacagcttcgagggcgagcggccgggtcgccgc tccaagaagatctggctggtgcccatcgacttcgacaagttagagatcccccagcctgac ttcgaggtgcccggtgccgactgcctgctcagcaaggagcagctacgcagcatctacgag agcagcctgtccatcggcaacttcgcctcgcgcctgctggtgcacctgttccccgagctc ttcacgcacgagaacctgcgcaagcagtacaactgcagcggctccctgggcaagaagcag ctggacccctcccgcatcaagctcatccgccactacgtgcagctgctctacccacgcgcc aaaaacgaccgcgtctggaccctggagttcgtgggcaaactggatgagcgctgccggcgc cgggacacggagcaaaggcgctcctaccagcagcagcgcaaggtccacgtgccgggccct gagtgcagagacttgaccagctatgcaatcaaccccgagaggttccgggaggagtttgag gggcccccactgccccccgagaggagcagcaaggacttttgcaagatccccttggacgag ctggtggtcccctcgcctgacttcccggtgccttctccctacctgctgtctgacaaggag gtgcgtgagatcgtgcagcagagcctctccgtgggcaactttgccgcccggctcctcgtc cgcctgtttcccgaactcttcaccgccgagaacctccggctgcagtacaaccattccggg gcttgcaacaagaagcaactggaccccacgcggctgcggctcatccgccactacgtggaa gccgtctacccggtggagaagatggaggaggtgtggcactacgaatgtatccccagcatc gatgagaggtgccgccgccccaacaggaaaaaatgcgacatcctcaagaaagcaaagaaa gtggagaagtga