GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:14:14 Sequence gi568815577r:37525162_37815131 : 289970 bp : 44.21% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 2013 2152 140 1 2 61 49 133 0.240 4.83 1.02 PlyA + 5390 5395 6 1.05 2.03 PlyA - 7943 7938 6 1.05 2.02 Term - 9821 9784 38 0 2 96 44 44 0.560 -1.70 2.01 Init - 10761 10557 205 0 1 82 88 104 0.917 8.81 2.00 Prom - 16161 16122 40 -4.36 3.13 PlyA - 16775 16770 6 1.05 3.12 Term - 16884 16778 107 1 2 32 49 81 0.336 -2.83 3.11 Intr - 20888 20724 165 0 0 -19 64 137 0.012 0.53 3.10 Intr - 27049 26939 111 2 0 52 43 155 0.164 7.75 3.09 Intr - 35474 35318 157 2 1 -21 66 110 0.038 -2.52 3.08 Intr - 36918 36778 141 0 0 95 -17 110 0.077 1.75 3.07 Intr - 38441 38170 272 0 2 109 64 167 0.278 13.66 3.06 Intr - 38792 38636 157 2 1 6 40 109 0.617 -2.52 3.05 Intr - 39177 38921 257 1 2 -1 55 189 0.045 3.86 3.04 Intr - 39344 39224 121 2 1 41 60 64 0.018 -1.03 3.03 Intr - 46318 46106 213 1 0 57 100 45 0.342 1.51 3.02 Intr - 50230 50169 62 2 2 46 90 115 0.601 5.95 3.01 Init - 54729 54681 49 0 1 80 103 -5 0.476 1.31 3.00 Prom - 58855 58816 40 -5.56 4.10 PlyA - 59293 59288 6 1.05 4.09 Term - 69487 69258 230 2 2 114 47 103 0.595 5.59 4.08 Intr - 71850 71619 232 0 1 51 93 123 0.470 6.45 4.07 Intr - 73174 73013 162 1 0 59 -13 130 0.265 0.17 4.06 Intr - 84883 84812 72 1 0 67 92 33 0.223 1.20 4.05 Intr - 91075 90885 191 2 2 61 90 32 0.035 0.00 4.04 Intr - 92535 92434 102 1 0 114 52 51 0.052 4.25 4.03 Intr - 99825 99667 159 1 0 63 93 41 0.111 2.06 4.02 Intr - 100323 100036 288 1 0 120 19 323 0.131 25.82 4.01 Init - 109248 109182 67 0 1 98 57 26 0.363 1.74 4.00 Prom - 114396 114357 40 -6.86 5.00 Prom + 115558 115597 40 -3.56 5.01 Init + 117738 117747 10 2 1 71 105 -3 0.281 0.38 5.02 Intr + 121386 121462 77 1 2 112 81 68 0.670 7.73 5.03 Intr + 129314 129368 55 1 1 102 37 30 0.218 -2.15 5.04 Intr + 132655 132761 107 2 2 55 89 70 0.753 3.73 5.05 Term + 134619 134729 111 2 0 96 44 65 0.804 1.46 5.06 PlyA + 136336 136341 6 1.05 6.06 PlyA - 136931 136926 6 1.05 6.05 Term - 142534 142300 235 0 1 -8 41 374 0.600 18.89 6.04 Intr - 150427 150139 289 2 1 8 4 223 0.072 2.20 6.03 Intr - 150926 150816 111 0 0 60 72 66 0.446 2.55 6.02 Intr - 185255 185163 93 0 0 78 88 18 0.187 0.74 6.01 Init - 189947 189050 898 2 1 66 110 661 0.737 60.56 6.00 Prom - 200577 200538 40 -5.36 7.00 Prom + 202816 202855 40 -4.56 7.01 Init + 216904 216953 50 0 2 99 73 69 0.658 6.93 7.02 Term + 234782 234908 127 2 1 85 43 159 0.163 8.86 7.03 PlyA + 236824 236829 6 1.05 8.06 PlyA - 237391 237386 6 -0.45 8.05 Term - 237971 237857 115 1 1 70 42 44 0.131 -3.96 8.04 Intr - 238253 238051 203 2 2 46 81 111 0.318 4.28 8.03 Intr - 242056 241964 93 1 0 104 66 65 0.714 6.06 8.02 Intr - 250941 249877 1065 0 0 20 13 397 0.064 15.88 8.01 Init - 252155 251766 390 2 0 72 41 221 0.156 12.58 8.00 Prom - 272351 272312 40 -3.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577r:37525162_37815131|GENSCAN_predicted_peptide_1|46_aa XQPLLLKVLGPVGGRHLGENHQEDGYVPSSGRWQSRDEFAGLQLVC >gi568815577r:37525162_37815131|GENSCAN_predicted_CDS_1|141_bp ncacagcctctgcttcttaaagtcctggggcctgtaggaggaaggcaccttggtgagaac caccaagaggatggctacgtgcccagcagcgggagatggcagtccagagatgagtttgca ggcctgcagctcgtctgctga >gi568815577r:37525162_37815131|GENSCAN_predicted_peptide_2|80_aa MGHDCHRCCAKGSRMQEAICPLRKPLLPPLPPSAQVLLSSVALSYGSQMPLVALLLQGTK SSQTIKRGAPCDDTGSIQIT >gi568815577r:37525162_37815131|GENSCAN_predicted_CDS_2|243_bp atgggccacgactgtcacagatgctgtgccaaggggtcccggatgcaggaagccatctgc cctctgcgcaaacctctcctgcccccactgccaccttctgcccaagtcctcctctcctcc gtggctctttcttacggctcccagatgccacttgtagctcttctacttcagggcaccaag tcaagccagactataaaaaggggagccccttgtgatgacactggctccatccagataacc tag >gi568815577r:37525162_37815131|GENSCAN_predicted_peptide_3|603_aa MDEPMSGEKTGPFHSCGADESSYVTKTPKGGGLESGKISKDNCTFKELKKAQSKEIKEST RPMSHQIQNMNKETEIKKLSKYISINFGAEKYSNQNEKFTSGSTADLKTHRNDDSISLPP SARKKITNASNCESPQRFRCRSTWGVHGGVKSKLGMGLDVVERGHGLCKDLPRRVPSRHH DVTGSKDVHPRSKPVFLRSNFGALGHSLEGLMRDGCPASCWGPKDAGREAQQAKGVAREL PVNRRRGRTVTWSLRTRTGTVTWPTSGLRSFALGPSEASAAPWNPSGKLEARENVPAGNL RKSDGSSACQELASCSQIHVCFLQLPCGAGYLVPALEMRKQAQMALKTFPETAQSQAFPT RTPYSLLLSDHPRATLQKKGQIQKSRAPPLSFEKCYHENEAYRPKEIKSILAPHYQYCQM LVQGDDPYENGAPRVAQCLARTTVPGDLEWKAHLDTLHGSVTQAIDEAEAPQALDSSAPE LEALAKLQKLAIDDQVLAIWGQQLQSSRTGYPAAQPYAGMLLSHSLNTGRSSHRPQALAV SLRRSSLRTPSTLWTYIAILEKEHPQQKDPTTNLLKNQKKKKHVKHHHKDAISKTQTLGD STE >gi568815577r:37525162_37815131|GENSCAN_predicted_CDS_3|1812_bp atggatgagcccatgtcaggtgagaaaactggtcccttccattcctgtggggctgatgaa agcagctacgtgaccaagacaccaaaaggaggtggcctggagagtggaaagatatcaaag gacaactgtacgttcaaagaattaaagaaagcccaatctaaagaaataaaggaaagtacg agacccatgtctcaccaaatacagaacatgaataaagagacagaaattaagaaattgtca aaatatatatctataaattttggagctgaaaagtacagtaaccaaaatgaaaaattcacc agtggctcaacagcagatttgaagacacacaggaacgacgacagcatcagcctgccacct tctgccaggaagaagataaccaatgctagcaactgtgagtcaccacagaggtttcgctgc agatccacctggggggtgcacgggggagtgaaaagcaagttgggaatgggcttggatgtc gtggagaggggacatggcctctgcaaggatcttcccagaagggtgccctcaaggcaccac gacgtcacgggcagtaaggacgtgcaccctcggagcaagcctgttttcctccggagcaac ttcggagctttagggcattcgctggagggcctcatgagggacggctgccctgcaagttgc tggggaccaaaggacgcagggcgcgaagctcagcaagcaaagggcgtggccagggagctc ccagtcaatcgacgccgcggacgcacagtcacgtggtcgctgcggacgcggacggggacg gtcacgtggccgacaagcgggctgcgttctttcgccttgggcccctcggaagcgtccgca gccccctggaaccctagcggaaaacttgaggcccgagagaacgttccagcaggaaatctg cggaaatcggatgggagcagtgcatgccaggaactcgccagctgctcccagatacatgtc tgcttcctgcaactaccctgcggggcgggctacctggtccctgccttagagatgaggaaa caggctcagatggcattgaaaaccttccccgagactgcacaaagccaggcttttcccacc aggacaccctactcgctgcttctgagtgaccatcctcgtgccacacttcagaaaaaaggg cagattcagaaatcaagagcccctccgttaagttttgagaaatgttatcatgaaaatgaa gcatatcggccaaaggaaattaaatccatcttggctccccactaccagtactgccaaatg cttgttcagggagatgacccctatgagaatggtgcccctagagttgctcagtgcctggca cgcactactgtacctggtgacctagagtggaaagcacatttggacaccctccacggcagc gtaacacaggccattgatgaggctgaggctccccaagcactagacagcagtgctcctgag ttggaagcactggcaaaacttcagaagctggccatcgatgaccaagtcctggccatttgg ggccaacagctgcagagctcgcggactggctacccagcagcccagccatatgcgggaatg ctgctgtctcactctctgaacactggtcggtctagtcaccgaccacaagccctggctgtc tctctgaggcggtcctcactcaggaccccatccaccttgtggacctacatcgccatcctg gagaaggagcatccacaacaaaaagatccaactaccaatcttctaaaaaaccagaagaaa aagaaacatgttaaacaccaccacaaagatgcaatcagcaaaacccagactctaggagat tctacagaataa >gi568815577r:37525162_37815131|GENSCAN_predicted_peptide_4|500_aa MAPASGEGFRKLPLIAEGEKGAGMTCQARSSYITSEILWGYRFTPVLTLEDGFYEVDYNS FHETYETSTPSLSAKELAELASRAELPLSWSVSSKLNQHAELETEEEEKNLEEQTERNVI QIDKVDMDLMKVQSALICGPSLVSSQNTTHPTPGDFSYLPACVVQYQITQKGSLTFWYHD AVGTDSVIGPMAELSPDSLTWSLFLNCTLGKLPALVGAWLPRMCLAAMVTPSLSVWTTLP QARGDLAEAPWGGGSLGLLHLQTALGEPKHPPTTSCSPVITTTKPGSREMSCEVPHTEAM NEYEERADHTLFEKCAHAGMKSDRIKSQSVIEMMMNFPAHPTAAGELKGYTRQQFCHDNT PNKGDGIIYNLTRPPYCCVRFPLAGREPHILYLSRLASNLELFKTGKGRGEQRKEEVTLR NAEKAPAQGNAHPALLCFLLRIHTSLPRQKVATRKTEPHHIHGIPTRGSQQLPDENTTVR IAIHVIPWCAPMLKPRRAAL >gi568815577r:37525162_37815131|GENSCAN_predicted_CDS_4|1503_bp atggccccggcctctggtgagggcttcaggaagcttccactcattgcagaaggggaaaag ggagcagggatgacatgccaagctcgaagctcctacatcaccagtgagatcctgtggggt taccggttcacacctgtcctgaccctggaggacgggttctacgaagttgactacaacagc ttccatgagacctatgagaccagcaccccatcccttagtgccaaagagctggccgagtta gccagcagggcagagctgcccctgagttggtctgtatccagcaaactcaaccaacatgca gaactggagactgaagaggaagaaaagaacctcgaagagcaaacagaaagaaatgttatt cagattgacaaggtagacatggatttgatgaaagtgcaaagtgccctcatttgtggccca agcctggtctcctcccaaaatactacacatccaactcctggagatttcagttacttacct gcatgtgttgtacaataccagatcactcaaaaaggaagtctcacgttttggtatcacgat gccgtgggcacagactcagtcatcgggccaatggcagaactctctccagatagtctgacc tggtccctctttttgaactgtactctggggaaactcccagcacttgtgggtgcctggtta ccgaggatgtgcttggctgccatggtaactcccagcctctcagtgtggaccactctcccg caggctcgaggagacctggcagaggccccatggggcggaggctccctagggctgctccac ctgcagacagccttgggagaacccaagcacccaccgaccacgtcctgctcccctgtcatt accaccactaaacccggtagcagagagatgagctgtgaggttcctcacacagaagccatg aatgaatacgaggaacgggccgaccacacactgtttgaaaaatgtgcccatgcaggcatg aaatcagacagaatcaaatctcaatcagtcatagaaatgatgatgaacttcccagctcat cccactgctgctggagagttgaaagggtacactcgccagcagttttgccacgacaataca ccaaacaaaggagacgggatcatttataacctgacgcgtccaccctactgctgtgtccgg tttccactggctggaagggaacctcacattctgtatttgtctcgattggctagcaactta gaactttttaaaacaggcaaaggcagaggagaacagaggaaggaggaagtaactttgcgg aatgctgagaaagcacctgcccagggcaacgcccacccggccctcctctgcttccttctc cgaattcacacatcacttccccgccaaaaagtagcaacaaggaagacagaaccacatcat attcacggcattccaacgagagggtcacagcagctccctgatgaaaacacaactgtcagg atcgctattcacgtcatcccttggtgtgcacccatgctgaagccacgaagagccgcccta tga >gi568815577r:37525162_37815131|GENSCAN_predicted_peptide_5|119_aa MPSGRVGMFQFQQPVCRAMFSRAREQEQQFTLTAPNTAVLGETAPVREKSLLHEGNFLGE TASEKSVFHDGVFPVMVLQAGRVDSTLSTFGFFMPSPMSSDSKPFIKLPITIISGNMAV >gi568815577r:37525162_37815131|GENSCAN_predicted_CDS_5|360_bp atgccctctgggagggttggcatgttccagttccagcagccggtgtgccgggcgatgttc agcagagccagagagcaggaacagcagtttactctcacagccccaaatacagcagtttta ggagaaacagcaccagtcagggaaaagtccctgctccatgaagggaattttctgggggag acggcatctgagaaatcagtatttcacgacggtgtctttccagtgatggttctgcaagct ggtagagtggattcaaccttgtccacttttggcttcttcatgccctctccaatgagcagt gattcaaaaccttttataaaactacccatcactatcatctctggcaacatggctgtctag >gi568815577r:37525162_37815131|GENSCAN_predicted_peptide_6|541_aa MDQDVESPVAIHQPKLPKQARDDLPRHISRDRTKRKIQRYVRKDGKCNVHHGNVRETYRY LTDIFTTLVDLKWRFNLLIFVMVYTVTWLFFGMIWWLIAYIRGDMDHIEDPSWTPCVTNL NGFVSAFLFSIETETTIGYGYRVITDKCPEGIILLLIQSVLGSIVNAFMVGCMFVKISQP KKRAETLVFSTHAVISMRDGKLCLMFRVGDLRNSHIVEASIRAKLIKSKQTSEGEFIPLN QTDINVGYYTGDDRLFLVSPLIISHEINQQSPFWEISKAQLPKEELEIVVILEGMVEATV IINSGSNLLPLYPHSLAAHQNHIILICKYFALLQFCGAHESPTDPVKMQAQIQEVWVLHL QPAGKGAGRCGGAQGLGVIAAAPPRGAGDRGAEAQPEGPRDDSGAHPQRGQRTRNPHRTC SPERRSERVRHLAVRRLPSPAQFLKANAATRGKIPANLIPSSLSADPGARYPGPATRRLL PWTCYPGACYPRPRYPGSCYPSTHYPGACYPGARYPGPRYPGAHYPSVLYPSSRYPGAYY P >gi568815577r:37525162_37815131|GENSCAN_predicted_CDS_6|1626_bp atggatcaggacgtcgaaagcccagtggccattcaccagccaaagttgcctaagcaggcc agggatgacctgccaagacacatcagccgagatcggaccaaaaggaaaatccagaggtac gtgaggaaagacggaaagtgcaatgttcatcacggcaacgtgagggagacctatcgctac ctgaccgatatcttcaccacattagtggacctgaagtggagattcaacctattgattttt gtcatggtttacacagtgacctggctcttttttggaatgatctggtggttgatcgcatac atacggggagacatggaccacatagaggacccctcctggactccttgtgttaccaacctc aacgggttcgtctctgcttttttattctcaatagagacagaaaccaccattggttatggc taccgggtcatcacagataaatgcccagagggaattattcttctcttaatccaatctgtg ttggggtccattgtcaatgcattcatggtgggatgcatgtttgtaaaaatctctcaaccc aagaagagggcagagaccctggtcttttccacccatgcagtgatctccatgcgggatggg aaactgtgcctgatgttccgggtaggggaccttaggaattcccacattgtggaggcttcc atcagagccaagttgatcaaatccaaacagacctcggagggggagttcatcccgttgaac cagacggatatcaacgtagggtattacacgggggatgaccgtctgtttctggtgtcaccg ctgatcattagccatgaaattaaccaacagagtcctttctgggagatctccaaagcccag ctgcccaaagaggaactggaaattgtggtcatcctagaaggaatggtggaagccacagtg attatcaattcagggtcaaacttgcttcctctgtatccccactcattggctgcccaccag aatcacatcatcctcatctgcaaatatttcgcactcctgcagttctgtggtgcccacgaa tctcccacggaccctgttaaaatgcaggctcagattcaggaggtctgggttctgcatttg caacctgctggcaagggtgcagggcgatgcggcggcgcccagggactgggtgtcatcgca gcagctcctcccaggggtgctggggaccgaggggcagaggcgcagcccgagggtccccgc gatgacagtggtgctcatccccagcgcggtcagcgcaccagaaacccgcaccgtacctgc agccctgaaagacgcagcgagcgcgtgcgccacctagcggtgcgtcggctgcccagcccg gcgcagtttcttaaagccaatgcagccactcgcggaaaaatccccgccaacttgattcct tcctctctgagtgccgaccccggagcccgctaccccggacctgctacccggcgcctgcta ccctggacctgctaccccggtgcctgctaccccagaccccgctaccctggctcctgctac cccagcacccactaccctggagcctgctaccccggcgcccgctaccccggaccccgctac cccggcgcccactaccccagcgtcctctaccccagctcccgctaccccggcgcctactac ccctga >gi568815577r:37525162_37815131|GENSCAN_predicted_peptide_7|58_aa MGKLQVSYLEMDILCDWGSQMLHRDDTQAVLPRGPCQKERMPLTHRQHQLASHGYELL >gi568815577r:37525162_37815131|GENSCAN_predicted_CDS_7|177_bp atggggaagctgcaggtgagctacctagaaatggacatcctgtgtgattggggcagccag atgctacatcgtgatgacactcaagcagtcctacccagaggcccatgtcagaaggaacga atgcctctcacccacaggcagcaccaactcgccagccacggctacgagctgctttag >gi568815577r:37525162_37815131|GENSCAN_predicted_peptide_8|621_aa MPLFDKGCGRHANKIDRPLARLIKKKREKNQIAAIKNDKGDITTNPTEIQTTIREYYKHL YANKLENLEEMDKFLDTYALPRLNQEEVESLNRPITGSEIEAIINSLPTKKSPGPDGFTA KFYQRYEEELRIKYLGIQLTRDVKDLFKENYKPLLNEIKDDTNKWKNIPCSWVGKINIVK MAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKAILSQKNKVGGITLPD FKLYYKATVTKTAWYWYQNRNIDQRNRTEPSEIMPHIYNYLIFDKPDKNRKWGKDSLFNK WCWENWLAICRKLKLDPFVTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDF MSKTPKAMATKARIDKWVLIKLKSFCTAKETTIRVNRQPTEWEKIFAIYSPDKGLISRIY KELKQIYKKKTNNPINKWAKDMNRYFSKEDIYAAKRHMKKCSSSLATREMQIKTIMRYHL TPVRMTKMCARQSLKLSDNGTQNNSNPMYSVPNTNADQEQEVREEKGRRGDGVVPAVQGI TSLPREDRSSHPPKIPPLPSSPGWGEKVSAGPPYKQAAIQRGTRNFGKFLLTCHHQMPKS VGPDESGFLGSPGSHWWLRVI >gi568815577r:37525162_37815131|GENSCAN_predicted_CDS_8|1866_bp atgcccctgtttgataaagggtgcgggaggcatgctaacaaaattgatagaccgctagca agactaataaagaagaaaagggagaagaatcaaatagctgcaataaaaaatgataaaggg gatatcaccaccaatcccacagaaatacaaactaccatcagagaatactataaacacctc tacgcaaataaactagaaaatctagaagaaatggataaattcctcgacacatacgccctc ccaagactaaaccaggaagaagttgaatctctgaatagaccaataacaggctctgaaatt gaggcaataattaatagcttaccaaccaaaaaaagtccaggaccagacggattcacagcc aaattctaccagaggtacgaggaggagctgagaataaaatacctaggaatccagcttaca agggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagac gatacaaacaaatggaagaacattccatgctcatgggtaggaaaaatcaatatcgtgaaa atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cgcatcgccaaggcaatcctaagccaaaagaacaaagttggaggcatcacgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga aatatagaccaacggaacagaacagagccctcagaaataatgccacatatctacaactat ctgatctttgacaaacctgataaaaacaggaaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactagatcccttcgttaca ccttacacaaaaattaattcaagatggattaaagacttaaatgttagacctaaaaccata aaaactctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccagaattgacaaatgggttctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaataggcaacctaca gaatgggagaaaatttttgcaatctactcacctgacaaagggctaatatccagaatctac aaagaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaacaagtgggcaaag gatatgaacagatacttctcaaaagaagacatttatgcagccaaaagacacatgaaaaaa tgctcatcttccctggccaccagagaaatgcaaatcaaaaccataatgagataccatctc acaccagttagaatgacaaagatgtgtgctcgccaatccttgaagctgtcagataatggg acacagaacaattccaatcccatgtacagtgtgcccaacactaatgcggatcaggagcag gaagtaagagaggagaaaggaagacgaggtgatggtgttgtcccagctgttcagggaatc acctccttaccacgagaagacagatcatcgcatccaccaaaaattccacccttgccttca agtcctgggtggggtgagaaggtgtctgcaggccccccctacaagcaggctgccatccag cgtgggacaagaaactttggaaaattcctcctcacatgccatcatcagatgcccaaatca gtgggtcctgatgaatctggatttctcgggagcccgggcagtcactggtggctgcgtgtc atttag