GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:23:17 Sequence gi568815597r:205618283_205829624 : 211342 bp : 45.66% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 2556 1828 729 0 0 80 33 413 0.350 26.63 1.04 Intr - 5609 5394 216 2 0 91 62 182 0.985 14.60 1.03 Intr - 7882 7704 179 2 2 14 -1 182 0.504 1.54 1.02 Intr - 12913 12810 104 0 2 99 53 68 0.854 4.22 1.01 Init - 13533 13427 107 0 2 69 47 196 0.998 11.29 1.00 Prom - 18802 18763 40 -2.06 2.05 PlyA - 19365 19360 6 1.05 2.04 Term - 41389 40952 438 1 0 97 43 321 0.984 23.78 2.03 Intr - 43844 43579 266 0 2 135 78 355 0.996 36.43 2.02 Intr - 45336 44551 786 1 0 133 116 850 0.953 83.75 2.01 Init - 46374 46203 172 0 1 76 92 277 0.999 24.50 2.00 Prom - 55444 55405 40 -1.96 3.06 PlyA - 56368 56363 6 1.05 3.05 Term - 60512 60379 134 0 2 110 40 103 0.808 5.95 3.04 Intr - 69196 69104 93 2 0 80 81 67 0.459 5.14 3.03 Intr - 72076 72032 45 2 0 122 88 -2 0.317 1.58 3.02 Intr - 75063 75031 33 1 0 122 92 13 0.162 3.29 3.01 Init - 85679 85667 13 2 1 81 61 1 0.020 -2.86 3.00 Prom - 90297 90258 40 -2.36 4.07 PlyA - 91661 91656 6 1.05 4.06 Term - 100197 99998 200 1 2 91 42 163 0.997 9.56 4.05 Intr - 101394 101245 150 1 0 70 91 169 0.996 15.53 4.04 Intr - 102371 102219 153 0 0 50 36 209 0.978 11.94 4.03 Intr - 105699 105644 56 2 2 73 67 82 0.917 3.22 4.02 Intr - 109551 109418 134 0 2 -14 82 142 0.798 2.84 4.01 Init - 111295 111290 6 1 0 61 111 0 0.609 0.57 4.00 Prom - 113829 113790 40 -4.36 5.00 Prom + 116968 117007 40 -2.66 5.01 Init + 130751 130844 94 1 1 65 80 63 0.782 3.74 5.02 Intr + 131168 131268 101 0 2 87 81 9 0.630 -0.07 5.03 Term + 131682 132113 432 2 0 40 54 182 0.561 5.30 5.04 PlyA + 135774 135779 6 1.05 6.06 PlyA - 136241 136236 6 1.05 6.05 Term - 152171 152060 112 1 1 85 39 89 0.986 1.73 6.04 Intr - 152572 152451 122 1 2 81 107 41 0.591 4.59 6.03 Intr - 153371 153190 182 0 2 116 95 193 0.999 22.39 6.02 Intr - 154285 154214 72 2 0 19 103 103 0.934 4.28 6.01 Init - 156674 156551 124 2 1 86 78 198 0.970 19.18 6.00 Prom - 160023 159984 40 -1.46 7.12 PlyA - 160030 160025 6 1.05 7.11 Term - 173436 173251 186 0 0 92 33 161 0.991 8.49 7.10 Intr - 176736 176588 149 1 2 110 87 168 0.940 18.85 7.09 Intr - 177196 177062 135 2 0 98 57 168 0.983 15.24 7.08 Intr - 178721 178642 80 1 2 79 77 41 0.964 1.29 7.07 Intr - 179769 179622 148 1 1 95 96 149 0.999 15.69 7.06 Intr - 180533 180387 147 0 0 101 35 259 0.814 22.11 7.05 Intr - 180819 180675 145 0 1 106 86 243 0.971 25.76 7.04 Intr - 181548 181477 72 0 0 108 110 89 0.997 12.70 7.03 Intr - 182778 182671 108 0 0 61 73 111 0.990 7.38 7.02 Intr - 192083 191788 296 0 2 117 91 339 0.979 33.83 7.01 Init - 195132 194838 295 0 1 64 51 244 0.931 13.65 7.00 Prom - 207035 206996 40 -2.16 8.02 PlyA - 209763 209758 6 1.05 8.01 Term - 210461 210338 124 1 1 86 51 135 0.748 7.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 52302 52394 93 2 0 109 72 61 0.912 6.76 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:205618283_205829624|GENSCAN_predicted_peptide_1|445_aa MRAAAGTRRARLGRRRLLRLRRGGRSGCRAVLEFPASLVIRCRVSFLFLFRSSGPPPLFT FRLVHSCIEAGVAASQTVDTLVTVGNVEKEVFMVFLVELTHCGTGGQNDIVDKEKQGILS SEMNSLLDQELIAMDSAITLWQFLLQLLQKPQNKHMICWTSNDGQFKLLQAEEVARLWGI RKNKPNMNYDKLSRALRYYYVKNIIKKVNGQKFVYKFVSYPEILNMDPMTVGRIEGDCES LNFSEVSSSSKDVENGGKDKPPQPGAKTSSRNDYIHSGLYSSFTLNSLNSSNVKLFKLIK TENPAEKLAEKKSPQEPTPSVIKFVTTPSKKPPVEPVAATISIGPSISPSSEETIQALET LVSPKLPSLEAPTSASNVMTAFATTPPISSIPPLQEPPRTPSPPLSSHPDIDTDIDSVAS QPMELPENLSLEPKDQDSVLLEKDK >gi568815597r:205618283_205829624|GENSCAN_predicted_CDS_1|1335_bp atgcgggcggctgcgggcacccggcgggctcggcttggccgccgccgccttctacggctc cgccgcgggggtcgcagcggctgccgcgccgtcctcgagtttccagcctcgctagtcatc cggtgtcgagtttcatttctttttctctttcgctccagcggccccccgcccctctttact ttccgccttgtgcacagctgcatcgaggctggggtagcagcatcacagactgtagacact ttggtcactgtaggcaacgtagagaaagaagtcttcatggtgttcctggtagagctgacc cattgtggcactggtgggcagaatgacattgttgacaaagaaaaacaaggcatcctcagc tcggagatgaattcgcttctggatcaagaactcattgctatggacagtgctatcaccctg tggcagttccttcttcagctcctgcagaagcctcagaacaagcacatgatctgttggacc tctaatgatgggcagtttaagcttttgcaggcagaagaggtggctcgtctctgggggatt cgcaagaacaagcctaacatgaattatgacaaactcagccgagccctcagatactattat gtaaagaatatcatcaaaaaagtgaatggtcagaagtttgtgtacaagtttgtctcttat ccagagattttgaacatggatccaatgacagtgggcaggattgagggtgactgtgaaagt ttaaacttcagtgaagtcagcagcagttccaaagatgtggagaatggagggaaagataaa ccacctcagcctggtgccaagacctctagccgcaatgactacatacactctggcttatat tcttcatttactctcaactctttgaactcctccaatgtaaagcttttcaaattgataaag actgagaatccagccgagaaactggcagagaaaaaatctcctcaggagcccacaccatct gtcatcaaatttgtcacgacaccttccaaaaagccaccggttgaacctgttgctgccacc atttcaattggcccaagtatttctccatcttcagaagaaactatccaagctttggagaca ttggtttccccaaaactgccttccctggaagccccaacctctgcctctaacgtaatgact gcttttgccaccacaccacccatttcgtccataccccctttgcaggaacctcccagaaca ccttcaccaccactgagttctcacccagacatcgacacagacattgattcagtggcttct cagccaatggaacttccagagaatttgtcactggagcctaaagaccaggattcagtcttg ctagaaaaggacaaa >gi568815597r:205618283_205829624|GENSCAN_predicted_peptide_2|553_aa MVQRLWVSRLLRHRKAQLLLVNLLTFGLEVCLAAGITYVPPLLLEVGVEEKFMTMVLGIG PVLGLVCVPLLGSASDHWRGRYGRRRPFIWALSLGILLSLFLIPRAGWLAGLLCPDPRPL ELALLILGVGLLDFCGQVCFTPLEALLSDLFRDPDHCRQAYSVYAFMISLGGCLGYLLPA IDWDTSALAPYLGTQEECLFGLLTLIFLTCVAATLLVAEEAALGPTEPAEGLSAPSLSPH CCPCRARLAFRNLGALLPRLHQLCCRMPRTLRRLFVAELCSWMALMTFTLFYTDFVGEGL YQGVPRAEPGTEARRHYDEGVRMGSLGLFLQCAISLVFSLVMDRLVQRFGTRAVYLASVA AFPVAAGATCLSHSVAVVTASAALTGFTFSALQILPYTLASLYHREKQVFLPKYRGDTGG ASSEDSLMTSFLPGPKPGAPFPNGHVGAGGSGLLPPPPALCGASACDVSVRVVVGEPTEA RVVPGRGICLDLAILDSAFLLSQVAPSLFMGSIVQLSQSVTAYMVSAAGLGLVAIYFATQ VVFDKSDLAKYSA >gi568815597r:205618283_205829624|GENSCAN_predicted_CDS_2|1662_bp atggtccagaggctgtgggtgagccgcctgctgcggcaccggaaagcccagctcttgctg gtcaacctgctaacctttggcctggaggtgtgtttggccgcaggcatcacctatgtgccg cctctgctgctggaagtgggggtagaggagaagttcatgaccatggtgctgggcattggt ccagtgctgggcctggtctgtgtcccgctcctaggctcagccagtgaccactggcgtgga cgctatggccgccgccggcccttcatctgggcactgtccttgggcatcctgctgagcctc tttctcatcccaagggccggctggctagcagggctgctgtgcccggatcccaggcccctg gagctggcactgctcatcctgggcgtggggctgctggacttctgtggccaggtgtgcttc actccactggaggccctgctctctgacctcttccgggacccggaccactgtcgccaggcc tactctgtctatgccttcatgatcagtcttgggggctgcctgggctacctcctgcctgcc attgactgggacaccagtgccctggccccctacctgggcacccaggaggagtgcctcttt ggcctgctcaccctcatcttcctcacctgcgtagcagccacactgctggtggctgaggag gcagcgctgggccccaccgagccagcagaagggctgtcggccccctccttgtcgccccac tgctgtccatgccgggcccgcttggctttccggaacctgggcgccctgcttccccggctg caccagctgtgctgccgcatgccccgcaccctgcgccggctcttcgtggctgagctgtgc agctggatggcactcatgaccttcacgctgttttacacggatttcgtgggcgaggggctg taccagggcgtgcccagagctgagccgggcaccgaggcccggagacactatgatgaaggc gttcggatgggcagcctggggctgttcctgcagtgcgccatctccctggtcttctctctg gtcatggaccggctggtgcagcgattcggcactcgagcagtctatttggccagtgtggca gctttccctgtggctgccggtgccacatgcctgtcccacagtgtggccgtggtgacagct tcagccgccctcaccgggttcaccttctcagccctgcagatcctgccctacacactggcc tccctctaccaccgggagaagcaggtgttcctgcccaaataccgaggggacactggaggt gctagcagtgaggacagcctgatgaccagcttcctgccaggccctaagcctggagctccc ttccctaatggacacgtgggtgctggaggcagtggcctgctcccacctccacccgcgctc tgcggggcctctgcctgtgatgtctccgtacgtgtggtggtgggtgagcccaccgaggcc agggtggttccgggccggggcatctgcctggacctcgccatcctggatagtgccttcctg ctgtcccaggtggccccatccctgtttatgggctccattgtccagctcagccagtctgtc actgcctatatggtgtctgccgcaggcctgggtctggtcgccatttactttgctacacag gtagtatttgacaagagcgacttggccaaatactcagcgtag >gi568815597r:205618283_205829624|GENSCAN_predicted_peptide_3|105_aa MEMIGKKSPVLPDFSGLDGLCKMVLCLGARGATLKPEVTSCVVKGSFVSNSKALPSCDDS QGLSWSQDSVVLKVLRKCSLSQQNEFEDKNTAVDECTAVDRQAPI >gi568815597r:205618283_205829624|GENSCAN_predicted_CDS_3|318_bp atggaaatgatagggaagaaaagtcctgttttaccagacttctcaggcctggatggattg tgcaaaatggtactatgccttggagcaagaggggcaacactcaaacctgaggtcaccagc tgtgtagtgaagggctcatttgtcagcaactccaaggctctgccatcttgtgatgacagt caaggcctgtcgtggagccaggactctgttgttctgaaagtgctcagaaaatgcagcttg agtcagcagaatgagtttgaagataaaaatacagctgtggatgaatgtacagctgtggac agacaggccccaatttag >gi568815597r:205618283_205829624|GENSCAN_predicted_peptide_4|232_aa MQLIKFEKHCLDEDYGRDSGPPTKKIRSSPREAKNKRRSGKNSQEDSEDSEDKDVKTKKD DSHSAEDSEDEKEDHKNVRQQRQAASKAASKQREMLMEDVGSEEEQEEEDEAPFQEKDSG SDEDFLMEDDDDSDYGSSKKKNKKMVKKSKPERKEKKMPKPRLKATVTPSPVKGKGKVGR PTASKASKEKTPSPKEEDEEPESPPEKKTSTSPPPEKSGDEGSEDEAPSGED >gi568815597r:205618283_205829624|GENSCAN_predicted_CDS_4|699_bp atgcagttaattaagtttgaaaaacactgtttagatgaagattatggaagagattcgggc cctcccactaagaaaattcgatcatctccccgagaagctaaaaataagaggcgatctgga aagaattcacaggaagatagtgaggactcagaagacaaagatgtgaagaccaagaaggat gattctcactcagcagaggatagtgaagatgaaaaagaagatcataaaaatgtgcgccaa caacggcaggcggcatctaaagcagcttctaaacagagagagatgctcatggaagatgtg ggcagtgaggaagaacaagaagaggaggatgaggcaccattccaggagaaagattccggc agcgatgaagatttcctaatggaagatgatgacgatagtgactatggcagttcgaaaaag aaaaacaaaaagatggttaagaagtccaaacctgaaagaaaagaaaagaaaatgcccaaa cccagactaaaggctacagtgacgccaagtccagtgaaaggcaaagggaaagtgggtcgc cccacagcttcaaaggcatcaaaggaaaagactccttctcccaaagaagaagatgaggaa ccggaaagcccgccagaaaagaaaacatctacaagccccccacccgagaaatctggggat gaagggtctgaagatgaagccccttctggggaggattaa >gi568815597r:205618283_205829624|GENSCAN_predicted_peptide_5|208_aa MKSDGYPNLALLLLKSLGDKRVTKISRDAYAGTPPSATERAQGPHPRASRWGKGQYGCLR PSSLKAATCSLSKQDRVEKPKTRTPPTPRARRPTPPELQPMGPLLPNPELLASQTPLLFG SGLLEQTSPPLPRLFKMDESNQPKMRQSRRATKPTRVCALLPNTPAHWRLLPDAPRSLVR LNHHVFQAPPCSLIGSTFTVGVLLVKML >gi568815597r:205618283_205829624|GENSCAN_predicted_CDS_5|627_bp atgaagtctgatggttacccgaacctagcccttttattactaaagtcactcggggacaag agggttacaaaaatatcccgagacgcgtacgcgggaacacccccgtccgctacagagagg gctcagggcccccacccccgcgccagtcgctgggggaaggggcaatacgggtgcctccgc ccctcctcgctgaaggccgcgacatgttcgctgtcgaaacaggaccgagtcgagaagcca aagaccaggaccccccccaccccgcgcgctcggcgccccaccccccccgaacttcagccg atgggaccgctgctgccgaaccccgagctgctggcttctcaaactccgctgctctttggt tcagggctcctggaacagacgagccccccgctcccccgtctcttcaaaatggatgaatca aaccagccgaaaatgcgccaaagccgccgtgcaaccaaacccactagggtttgtgcgctc ctccccaacacgcctgctcattggagacttctgccagatgcgcccagatcattagtccgt ttgaatcatcacgtcttccaggccccgccctgctctctgataggctccaccttcaccgtg ggggtcctgttagtcaagatgctctga >gi568815597r:205618283_205829624|GENSCAN_predicted_peptide_6|203_aa MGSRDHLFKVLVVGDAAVGKTSLVQRYSQDSFSKHYKSTVGVDFALKVLQWSDYEIVRLQ LWDIAGQERFTSMTRLYYRDASACVIMFDVTNATTFSNSQRWKQDLDSKLTLPNGEPVPC LLLANKCDLSPWAVSRDQIDRFSKENGFTGWTETSVKENKNINEAMRVLIEKMMRNSTED IMSLSTQGDYINLQTKSSSWSCC >gi568815597r:205618283_205829624|GENSCAN_predicted_CDS_6|612_bp atgggcagccgcgaccacctgttcaaagtgctggtggtgggggacgccgcagtgggcaag acgtcgctggtgcagcgatattcccaggacagcttcagcaaacactacaagtccacggtg ggagtggattttgctctgaaggttctccagtggtctgactacgagatagtgcggcttcag ctgtgggatattgcagggcaggagcgcttcacctctatgacacgattgtattatcgggat gcctctgcctgtgttattatgtttgacgttaccaatgccactaccttcagcaacagccag aggtggaaacaggacctagacagcaagctcacactacccaatggagagccggtgccctgc ctgctcttggccaacaagtgtgatctgtccccttgggcagtgagccgggaccagattgac cggttcagtaaagagaacggtttcacaggttggacagaaacatcagtcaaggagaacaaa aatattaatgaggctatgagagtcctcattgaaaagatgatgagaaattccacagaagat atcatgtctttgtccacccaaggggactacatcaatctacaaaccaagtcctccagctgg tcctgctgctag >gi568815597r:205618283_205829624|GENSCAN_predicted_peptide_7|586_aa MGAGTRALQAHGAGSRFPSLLLLRFPGAPPPGHEAKEGGRAEVDPAPSPVPDWLLLLSIR ESIFDGAGGVPPSLPQITFATGSQPVGPESAGKKRLGAHGPGREPLAGTSEFLGPDGAGV EVVIESRANAKGVREEDALLENGSQSNESDDVSTDRGPAPPSPLKETSFSIGLQVLFPFL LAGFGTVAAGMVLDIVQHWEVFQKVTEVFILVPALLGLKGNLEMTLASRLSTAANIGHMD TPKELWRMITGNMALIQVQATVVGFLASIAAVVFGWIPDGHFSIPHAFLLCASSVATAFI ASLVLGMIMIGVIIGSRKIGINPDNVATPIAASLGDLITLALLSGISWGLYLELNHWRYI YPLVCAFFVALLPVWVVLARRSPATREVLYSGWEPVIIAMAISSVGGLILDKTVSDPNFA GMAVFTPVINGVGGNLVAVQASRISTFLHMNGMPGENSEQAPRRCPSPCTTFFSPDVNSR SARVLFLLVVPGHLVFLYTISCMQGGHTTLTLIFIIFYMTAALLQVLILLYIADWMVHWM WGRGLDPDNFSIPYLTALGDLLGTGLLALSFHVLWLIGDRDTDVGD >gi568815597r:205618283_205829624|GENSCAN_predicted_CDS_7|1761_bp atgggcgctgggacccgcgcgctccaggcgcatggagccggctcccggttcccgtcactc ctcctactgcgtttccccggcgccccgcctcctgggcacgaagcgaaggaagggggccgg gccgaggttgatcccgccccctccccagtccctgattggctgctgctgttgtccatccga gaatctatttttgatggagcggggggggtgccacccagtctgccccagatcacgtttgcc accggcagccaaccagttgggcccgagtccgcgggcaagaagcgattgggggcgcatggc ccagggagagagcccttggctgggacctcagagttcctggggcctgatggggctggggta gaggtggtgattgagtctcgggccaacgccaagggggttcgggaggaggacgccctgctg gagaacgggagccagagcaacgaaagtgacgacgtcagcacagaccgtggccctgcgcca ccttccccgctcaaggagacctccttttccatcgggctgcaagtactgtttccattcctc ctggcaggctttgggaccgtggctgctggcatggtgttggacatcgtgcagcactgggaa gtcttccagaaggtgacagaggtcttcatcctagtgcctgcgctgctggggctcaaaggg aacctggaaatgaccctggcatcaaggctttccactgcagccaacattggacacatggac acacccaaggagctctggcggatgatcactgggaacatggccctcatccaggtgcaggcc acggtggtgggcttcctggcgtccatcgcagccgtcgtctttggctggatccctgatggc cacttcagtattccgcacgccttcctgctctgtgctagcagcgtggccacagccttcatt gcctccctggtactgggtatgatcatgattggagtcatcattggctctcgcaagattggg atcaacccagacaacgtggccacacccattgctgccagcctgggcgacctcatcaccttg gcgctgctctcaggcatcagctggggactctacctggaactgaatcactggcgatacatc tacccactggtgtgtgctttctttgtggccctgctgcctgtctgggtggtgctggcccga cgaagtccagccacaagggaggtgttgtactcgggctgggagcctgttatcattgccatg gccatcagcagtgtgggaggcctcatcttggacaagactgtctcagaccccaactttgct gggatggctgtcttcacgcctgtgattaatggtgttgggggcaatctggtggcagtgcag gccagccgcatctccaccttcctgcacatgaatggaatgcccggagagaactctgagcaa gctcctcgccgctgtcccagtccttgtaccaccttcttcagccctgatgtgaattctcgc tcagcccgggtcctcttcctcctcgtggtcccaggacacctggtgttcctctacaccatc agctgtatgcagggcgggcacaccaccctcacactcatcttcatcatcttctatatgaca gctgcactgctccaggtgctgattctcctgtacatcgcagactggatggtgcactggatg tggggccggggcctggacccggacaacttctccatcccatacttgactgctctgggggac ctgcttggcactgggctcctagcactcagcttccatgttctctggctcataggggaccga gacacggatgtcggggactag >gi568815597r:205618283_205829624|GENSCAN_predicted_peptide_8|41_aa XIHGVNEKISVQAYETQVKFIFELIQNADTDQEPVSHLHKL >gi568815597r:205618283_205829624|GENSCAN_predicted_CDS_8|126_bp nncatccatggagtcaacgagaaaatctcagtccaagcctatgagacccaagtgaaattc atctttgagttgattcagaatgctgacacagaccaggagccagtttctcacctgcacaaa ctgtga