GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:26:46 Sequence gi568815594r:98781031_99001983 : 220953 bp : 39.61% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 370 365 6 1.05 1.03 Term - 3795 3596 200 1 2 81 42 107 0.394 1.98 1.02 Intr - 5364 5297 68 2 2 91 68 68 0.484 2.73 1.01 Init - 8651 8551 101 2 2 82 95 58 0.970 5.70 1.00 Prom - 10887 10848 40 -5.15 2.04 PlyA - 12222 12217 6 1.05 2.03 Term - 20277 20215 63 0 0 99 54 99 0.182 4.51 2.02 Intr - 28205 28174 32 0 2 101 67 26 0.021 -1.27 2.01 Init - 32727 32556 172 0 1 70 15 157 0.270 6.25 2.00 Prom - 33858 33819 40 -6.95 3.00 Prom + 36125 36164 40 -5.65 3.01 Init + 39324 39385 62 2 2 90 91 84 0.976 9.67 3.02 Intr + 42654 42772 119 0 2 10 100 34 0.052 -4.01 3.03 Term + 57209 57435 227 0 2 78 47 152 0.293 6.06 3.04 PlyA + 57803 57808 6 1.05 4.16 PlyA - 57943 57938 6 1.05 4.15 Term - 61518 61299 220 2 1 80 39 103 0.003 0.13 4.14 Intr - 64444 64373 72 0 0 102 62 40 0.005 0.40 4.13 Intr - 67734 67641 94 1 1 57 57 86 0.019 0.50 4.12 Intr - 68153 68089 65 1 2 51 106 57 0.041 1.24 4.11 Intr - 90799 90524 276 0 0 96 116 55 0.614 4.81 4.10 Intr - 100112 100011 102 1 0 127 58 60 0.702 5.27 4.09 Intr - 104031 103892 140 0 2 43 110 159 0.999 11.94 4.08 Intr - 106162 106049 114 1 0 87 70 108 0.988 8.62 4.07 Intr - 106922 106859 64 1 1 76 91 -9 0.886 -4.00 4.06 Intr - 110302 110207 96 0 0 85 88 40 0.704 1.91 4.05 Intr - 120952 120846 107 1 2 86 115 102 0.972 10.79 4.04 Intr - 147613 147524 90 1 0 66 59 72 0.283 1.37 4.03 Intr - 148205 148065 141 2 0 51 117 68 0.572 5.63 4.02 Intr - 148894 148736 159 1 0 17 51 146 0.512 3.06 4.01 Init - 165549 165208 342 0 0 28 84 252 0.127 16.08 4.00 Prom - 166188 166149 40 -4.65 5.00 Prom + 171690 171729 40 -8.75 5.01 Sngl + 174970 175266 297 0 0 10 53 338 0.878 17.99 5.02 PlyA + 175601 175606 6 1.05 6.03 PlyA - 175788 175783 6 1.05 6.02 Term - 195879 195770 110 1 2 79 53 133 0.968 6.59 6.01 Init - 211761 211389 373 0 1 94 86 121 0.615 9.85 6.00 Prom - 211830 211791 40 -12.43 7.03 PlyA - 211910 211905 6 1.05 7.02 Term - 212489 212333 157 1 1 63 37 181 0.608 6.92 7.01 Init - 213001 212808 194 1 2 78 28 309 0.611 20.39 7.00 Prom - 213161 213122 40 -7.35 8.05 PlyA - 213173 213168 6 1.05 8.04 Term - 213765 213738 28 2 1 89 43 0 0.376 -7.83 8.03 Intr - 214377 214077 301 1 1 24 64 281 0.523 14.17 8.02 Intr - 215274 215063 212 2 2 34 10 207 0.417 5.13 8.01 Intr - 215447 215329 119 2 2 54 70 92 0.378 2.34 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 24952 25299 348 0 0 71 35 167 0.828 5.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:98781031_99001983|GENSCAN_predicted_peptide_1|122_aa MTFASDQFTFALTELLPPLGSHRCDCTLSSGLYCTSIAPNNKCLSSSCLPAVQGTGATKS EVSYNVQRHRNVKHRVMLREILVSGASMNELRSSGRSGQKCWKGNNQCGFVYRKLMAGFD FS >gi568815594r:98781031_99001983|GENSCAN_predicted_CDS_1|369_bp atgacctttgcttctgaccagttcacttttgctctgactgaacttcttccacctcttggt agccatcgctgtgattgtactttgtcttcaggattgtactgcacatccattgcaccaaac aacaagtgtctgagctcctcgtgtcttccagctgttcagggcactggcgccactaaatct gaagtctcatacaacgttcaaaggcacaggaatgtgaaacatcgcgttatgctcagggaa atccttgtgagtggagcatcgatgaatgagttaagaagcagtgggaggtcgggccagaaa tgttggaaaggcaacaatcagtgtggcttcgtgtacaggaaactcatggctgggtttgat ttctcataa >gi568815594r:98781031_99001983|GENSCAN_predicted_peptide_2|88_aa MKEWTVPLSNRKVRKEKGEEEREKDGLQQGGEGEMLREARERPTRYSVAESKVQAAALSG LDGSTSTQGVGAAVAGDSSHRGRSRLIS >gi568815594r:98781031_99001983|GENSCAN_predicted_CDS_2|267_bp atgaaagagtggacagttcccctgagtaacagaaaagttagaaaagagaaaggagaggaa gagagagaaaaggatggcctgcagcagggtggggaaggtgaaatgctcagggaggccaga gaaagacccacccgttacagcgttgctgaatcaaaagttcaggcggctgctttgagtgga ttagatggttctacatcaactcagggagtcggagcagcagtggccggtgacagttcacat cgtggccgctccaggctcatcagttga >gi568815594r:98781031_99001983|GENSCAN_predicted_peptide_3|135_aa MEVTKNKEGLRNCHRREEPDRLITQLETSHPPCVCVVSPLQRLLPVILSKFGAYLKVSLA NYRNDVRQKAIFFVEFKMGRKAEETTCNINSASGPELLMNVQCSGGSRSLCNNNEPFLDW IVTCDKKWILYNNQQ >gi568815594r:98781031_99001983|GENSCAN_predicted_CDS_3|408_bp atggaggtcaccaaaaacaaggaaggtctaagaaactgtcatagacgagaggagcctgac aggctcatcacacaactggaaacttctcaccctccatgtgtctgtgtagtgtcacctctt cagaggcttctcccagtcattcttagtaaatttggtgcctatctgaaggtatctttagcc aactacagaaatgatgttagacaaaaagcaattttcttcgttgagttcaaaatgggtcgt aaagcagaggagacaacttgcaacatcaacagtgcatctggcccagaactgctaatgaac gtacagtgcagtggtggttccagaagtttatgcaacaacaatgaaccatttctcgattgg attgtgacatgtgacaaaaagtggattttatacaacaaccagcaatga >gi568815594r:98781031_99001983|GENSCAN_predicted_peptide_4|693_aa MSVHCVIEVNKKTSPLSRENIWLLEKAEACEIAWLHGTIQTHQEKPRVQNLVILEESIAL LGLQDKDIRCQVTTAELGLEKKVTRADKKAEGKSHHPSQTKRVPSQGEKEEKQKAISVHV KAEYWNGCTELPNNFYRHARFSPEVLISGFGTVKAVVPPPRPAGGKRVPPGPNGHIRHVA RSWPIRFESHFFPLTPPSGAVVRSDRSKMATVEPNPSSGARVRGARTSSLDRWTGELVGE CCRRETTPTPNPPTTEEEKTESNQEVANPEHYIKHPLQNRWALWFFKNDKSKTWQANLRL ISKFDTVEDFWALYNHIQLSSNLMPGCDYSLFKDGIEPMWEDEKNKRGGRWLITLNKQQR RSDLDRFWLETLLCLIGESFDDYSDDVCGAVVNVRAKGDKIAIWTTECENREAVTHIGRV YKERLGLPPKIVIGYQSHADTATKSGSTTKNRKQPRKPGFSQSGKYWNLCTVIIMIVGVL NYKNDSRWFLNHPIKLYSLCRTVLVFHALPKAACSVDLFVHCLIAGSKFKNEEKGECEKI LNQRDYTEETPNPKEIDYNTDWLTLEGISRADGTAGKDSFTTNKQLPELLIQCHCNGNRL TAYSCEPTSDLWLQLRRNKDGSHIQVTLMQDVSSLGLGQLCPCGFAGYSLPPSCFHGLVL NVCGFSMHTVQAVVGSTILGSGGRWPSSHSSTR >gi568815594r:98781031_99001983|GENSCAN_predicted_CDS_4|2082_bp atgagtgtacactgtgtcatagaagtaaacaagaaaaccagtcccttgagcagggagaat atatggctcttagaaaaagcagaagcatgtgaaattgcctggttacatggaacaatccag acacatcaagaaaagccaagagtacagaatctagttatattggaggaaagcattgctctc ctaggccttcaagacaaagatattcggtgtcaggtcacaacagcagagttaggactggag aaaaaagttacaagagctgacaaaaaggctgaaggcaagagtcatcaccccagccaaaca aaaagagtaccttctcaaggagaaaaagaggagaagcagaaggctatttccgtgcacgta aaagcggaatattggaacggttgcacagaacttccaaataatttttaccgccacgcaaga tttagccctgaggtcttaatctcaggatttgggacagtaaaagctgtcgtccctccccct cgtccagccggtggcaagcgggtgccgccagggccaaacggacatatccgtcacgtggcc agaagctggccaatccggtttgaatctcatttttttcctcttacccccccttctggagcg gttgtgcgatcagatcgatctaagatggcgactgtcgaaccgaaccccagctccggggct cgggtgcggggcgcgcgcacgtcttccttagaccggtggactggggagctggtgggcgag tgttgccggagggaaaccacccctactcctaatcccccgactacagaagaggagaaaacg gaatctaatcaggaggttgctaacccagaacactatattaaacatcccctacagaacaga tgggcactctggttttttaaaaatgataaaagcaaaacttggcaagcaaacctgcggctg atctccaagtttgatactgttgaagacttttgggctctgtacaaccatatccagttgtct agtaatttaatgcctggctgtgactactcactttttaaggatggtattgagcctatgtgg gaagatgagaaaaacaaacggggaggacgatggctaattacattgaacaaacagcagaga cgaagtgacctcgatcgcttttggctagagacacttctgtgccttattggagaatctttt gatgactacagtgatgatgtatgtggcgctgttgttaatgttagagctaaaggtgataag atagcaatatggactactgaatgtgaaaacagagaagctgttacacatatagggagggta tacaaggaaaggttaggacttcctccaaagatagtgattggttatcagtcccacgcagac acagctactaagagcggctccaccactaaaaatagaaagcagccaaggaaaccagggttc tctcaaagtggaaaatactggaacttatgtactgttatcataatgatagttggtgttttg aattataagaatgattccaggtggtttctaaatcatccaataaagctgtattcactctgt aggactgtcttagtatttcatgcattacctaaagcagcatgtagtgtggatttatttgtt cattgccttattgcaggaagtaaatttaaaaatgaagaaaagggagaatgtgaaaaaata ttaaaccaaagggactatactgaggagacccccaacccaaaggaaatagactacaacact gattggctgactttggagggaatctcaagagctgatgggactgctggaaaagattccttc actaccaacaagcagctgcctgaacttttgattcagtgtcactgtaatggcaatagactg actgcctattcctgtgaaccgacaagtgatctctggctgcagctcagaagaaacaaggac gggtctcacatccaggtcacgctgatgcaagatgtgagttcccttggtcttgggcagctc tgcccctgtggctttgcagggtacagcctccctcccagctgctttcacgggctggtattg aatgtctgtggcttttccatgcacacagtgcaagctgttgttggatctaccattctgggg tctggaggacgatggccctcttctcacagctccactaggtag >gi568815594r:98781031_99001983|GENSCAN_predicted_peptide_5|98_aa MSDKRGFENVELGVIQGKKKVPRRAILFVSDDTMKEYSTDEDEVDGLERKDVLPTLDPEE EGGGGGGVKEEEEKNRMSTEAERHYQQRRRRKRRRMSV >gi568815594r:98781031_99001983|GENSCAN_predicted_CDS_5|297_bp atgagtgacaaaagaggctttgaaaatgtagaactgggagtgatacaaggaaagaagaaa gtcccaaggagagccatcctctttgttagtgatgacacaatgaaggaatacagcacagat gaagatgaagttgatggcctagagaggaaagatgttttgcctactcttgacccagaagaa gaaggaggtggaggaggtggagtgaaggaagaggaggagaaaaataggatgtctacagaa gcagaaagacattaccaacagaggaggaggaggaagaggaggagaatgtctgtatga >gi568815594r:98781031_99001983|GENSCAN_predicted_peptide_6|160_aa MKVCSFIPEAIETTNPPGRTSNSRSAALRAVTLTVKVCSFTPEPARPGTHQKEETLNTSE HQKEQTPGTMPLRTVTLTVRVHGFILEVSETKNPPIPDTTAHCSLNILSSSDVPASASGL VRTTGNNQTCEYHPEAAEQASKGRVIYLSTSPIRQGDTRP >gi568815594r:98781031_99001983|GENSCAN_predicted_CDS_6|483_bp atgaaggtctgcagcttcattcctgaagccatcgagaccaccaacccacccgggagaaca agcaactccagaagtgccgccttaagagctgtaacactcaccgtgaaggtctgcagcttc actcctgagccagcgagaccaggaacccaccagaaggaagaaactctgaacacatctgaa catcagaaggaacaaactccaggcacgatgcctttaagaactgtaacactcacggtgagg gtccatggcttcattcttgaagtcagtgagaccaagaacccaccaattccggacacaaca gctcactgcagcctcaacatcctgagctcaagtgatgttcctgcctcagcctctggacta gttaggactacaggaaacaatcagacgtgtgagtaccatccagaagcagcagaacaagct agtaaaggaagggtaatatacctctccacgtcaccaattcggcaaggggatactaggccc tga >gi568815594r:98781031_99001983|GENSCAN_predicted_peptide_7|116_aa MLAALAALACSRCLLASAPILAVLEEPFSRRCTVGALLWAGRGQRRFPQLVVGVVAEARV ATAAAAPGPMDHPRAQECGLTARDWQTAPPAAPVREPLGVASWAPESSGDLENLYV >gi568815594r:98781031_99001983|GENSCAN_predicted_CDS_7|351_bp atgctggcagccctcgcagcccttgcttgctctcggtgcctcctggcctcagctcccatt ctggccgtgcttgaggagcccttcagccgccgctgcaccgtgggagcccttctctgggct ggccgaggccagagacggttccctcagcttgtggtaggtgtggtggcagaggcacgggtg gcaactgcggctgcggcacccggtcccatggaccacccaagggctcaggagtgtgggctt acagctcgggactggcagacagctccacctgcggccccagtgcgggaaccactgggtgta gccagctgggctcctgagtctagtggggacttggagaacctttatgtctag >gi568815594r:98781031_99001983|GENSCAN_predicted_peptide_8|219_aa LLKCQSTVVKHKEVGLELASVPPLCDSDGPRATHLSEPGSAQTRAPPVLRMWRLATLGLP SFPAPLPASGSPHTWRPPPQRGEPAPERGGLARPRDPAQPASAPAHMAAAAENATCRRLT VRSRPLAAHSLCLQSDKTVVFTDEHPGALIIPDTRRRLERALRSSALDSPAAPWRTVPWV KFCFSLTLSGLRPKDHRSFTERRAIELSTGREERKTNIY >gi568815594r:98781031_99001983|GENSCAN_predicted_CDS_8|660_bp ttgcttaagtgtcaaagcacggtagtaaagcacaaagaagtggggctggagctagctagc gttcctcctctctgcgactccgacggtcctcgggccacccatctctcagagcccggcagc gcccagacccgagccccgccggtgctgcggatgtggaggctcgcgacgctcgggctgccc tccttcccggcgccgctacctgcctccggcagcccgcacacctggcggcccccgccccag cgcggggaaccggccccagaacgcggaggtctagcgcggccacgcgaccccgcacagccc gcttccgccccagcccacatggccgcggcggcagagaacgcgacctgccggcgcctgacg gtgaggagccgccctctcgcggcgcactctctgtgcctacagtcagataagacagtcgtc ttcactgacgaacacccaggagccctgatcatccccgacacccgcaggaggctagagagg gcgctgcgctcctctgcgttggattcccccgccgctccttggagaacagtcccttgggtt aagttctgtttctcactgacattgtcgggtctgaggcctaaggaccaccgctcattcaca gaaaggagagccatagagctgtcaacgggacgggaagagcgaaaaactaacatttactga