GENSCAN 1.0 Date run: 8-Nov-116 Time: 10:17:33 Sequence gi568815576r:38766212_38972190 : 205979 bp : 50.03% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1944 2040 97 2 1 121 64 141 0.377 14.58 1.02 Term + 3885 4141 257 1 2 21 53 163 0.418 1.85 1.03 PlyA + 5390 5395 6 1.05 2.09 PlyA - 6240 6235 6 1.05 2.08 Term - 13402 13238 165 1 0 109 47 296 0.788 25.52 2.07 Intr - 14331 14243 89 1 2 25 56 63 0.461 -3.51 2.06 Intr - 14798 14669 130 2 1 87 26 207 0.619 14.67 2.05 Intr - 16574 16452 123 2 0 48 98 81 0.512 5.88 2.04 Intr - 18044 18013 32 0 2 130 93 -2 0.490 2.35 2.03 Intr - 21915 21620 296 2 2 97 36 62 0.058 -1.45 2.02 Intr - 27349 27290 60 0 0 75 59 75 0.146 1.15 2.01 Init - 32271 32231 41 0 2 83 81 54 0.244 3.89 2.00 Prom - 46981 46942 40 -2.76 3.19 PlyA - 47094 47089 6 1.05 3.18 Term - 56622 56398 225 0 0 78 55 189 0.657 11.38 3.17 Intr - 57051 56872 180 0 0 47 110 221 0.855 20.26 3.16 Intr - 60536 60289 248 0 2 100 102 474 0.998 47.18 3.15 Intr - 62331 62076 256 0 1 84 76 315 0.736 26.92 3.14 Intr - 77815 77024 792 1 0 41 83 1355 0.671 121.97 3.13 Intr - 81915 81773 143 1 2 38 94 85 0.798 4.17 3.12 Intr - 85625 85491 135 0 0 55 83 65 0.734 3.24 3.11 Intr - 92573 92408 166 2 1 87 42 73 0.025 2.13 3.10 Intr - 94547 94441 107 0 2 95 20 70 0.645 0.83 3.09 Intr - 94688 94621 68 1 2 81 74 68 0.761 3.25 3.08 Intr - 97466 97338 129 1 0 52 96 46 0.235 1.61 3.07 Intr - 100990 100002 989 1 2 130 61 1250 0.289 116.17 3.06 Intr - 105335 105269 67 1 1 113 86 55 0.999 6.71 3.05 Intr - 105546 105481 66 2 0 82 87 116 0.997 8.92 3.04 Intr - 105734 105691 44 2 2 135 89 35 0.753 5.34 3.03 Intr - 106180 105911 270 1 0 34 105 199 0.501 13.94 3.02 Intr - 118021 117912 110 2 2 46 100 76 0.534 4.60 3.01 Init - 140948 140939 10 2 1 107 81 2 0.438 2.17 3.00 Prom - 143161 143122 40 -2.06 4.00 Prom + 150264 150303 40 -0.66 4.01 Init + 151888 151941 54 0 0 82 113 58 0.782 6.99 4.02 Intr + 156386 156510 125 1 2 51 100 23 0.207 -0.82 4.03 Intr + 170526 170630 105 0 0 103 75 36 0.000 3.13 4.04 Term + 180637 181057 421 1 1 31 47 201 0.000 5.06 4.05 PlyA + 182966 182971 6 1.05 5.00 Prom + 185030 185069 40 -3.76 5.01 Init + 186598 186639 42 0 0 83 110 51 0.820 7.22 5.02 Intr + 190922 191086 165 1 0 65 37 90 0.461 1.76 5.03 Intr + 191152 191213 62 0 2 98 29 55 0.450 -1.87 5.04 Intr + 193385 193534 150 2 0 -25 43 255 0.000 8.98 5.05 Intr + 194018 194042 25 2 1 65 95 -29 0.286 -6.47 5.06 Intr + 195176 195470 295 1 1 123 95 247 0.597 25.48 5.07 Intr + 195887 196002 116 0 2 127 94 63 0.998 10.97 5.08 Intr + 196217 196442 226 1 1 119 15 55 0.192 -1.14 5.09 Intr + 200056 200108 53 2 2 94 111 33 0.238 4.83 5.10 Intr + 205299 205394 96 2 0 47 115 46 0.046 3.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 68963 69020 58 1 1 86 77 90 0.828 7.25 S.002 Term - 92573 92322 252 2 0 87 54 111 0.888 3.24 S.003 Sngl + 180641 181057 417 1 0 49 47 209 0.865 9.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:38766212_38972190|GENSCAN_predicted_peptide_1|117_aa GPCETGNTERQHGVSSEWEEEELKPAGLQVQRALGIEASTPLGSLALPAPGNSVVMLAES AVKAPAGRELLLVAGMGRIWPFQGPAANTAFNKMDGEEGKRREPGTEKPAETLSWSP >gi568815576r:38766212_38972190|GENSCAN_predicted_CDS_1|354_bp gggccctgtgagaccggcaatactgagcggcaacatggggtgagctcggagtgggaagag gaggagctgaagcccgcaggcctgcaggtgcagcgagccctgggcatcgaggcctccact cctctgggctccctggcacttccggcccctgggaactcagtggtcatgttggccgagtct gcagtcaaggcccctgcgggaagagagctgctgctggtggcagggatgggccgtatctgg ccattccagggtccagcggcaaatacagcattcaacaaaatggatggagaggaagggaaa agaagggagcccgggacagagaaaccggctgagaccctgtcctggagcccctga >gi568815576r:38766212_38972190|GENSCAN_predicted_peptide_2|311_aa MEGEKKLSDGGMRRMSLFFDPTLGFREDVLPNGRRPRKTGVPACVPNVAGSKRQAWTEAG WVDAEYRSHLCYWLVSYLLVPKEKAQKVESEGWIGVPILVLPLPAVHPQATHKTSLSSSD LSENRELDPSLTEPNSRGCCDDKSQHLLKAPLSSSGHSGRIMGETEGKKDEADYKRLQTF PLVRHSDMPEEMRVETMELCVTACEKFSNNNEVLPAVQAAPRAFDGPCVGDTASRHQRRK AIHQSGGESEKSVLRQPSAAKMIKETMDKKFGSSWHVVIGEGFGFEITHEVKNLLYLYFG GTLAVCVWKCS >gi568815576r:38766212_38972190|GENSCAN_predicted_CDS_2|936_bp atggaaggagagaagaagttatcggacggaggaatgcggcggatgtcgttgttctttgac ccaactttaggcttccgagaggatgtgctccccaatgggagaaggccaaggaagactgga gtgcccgcctgtgtccccaatgtggcgggaagcaagaggcaggcctggacagaggctgga tgggtggatgctgagtatcgaagtcatctgtgttactggcttgtctcctacttgcttgtg ccaaaggagaaagcacagaaggtcgagtcagaaggatggattggggttccgatcctggtt ctaccattgccggctgtgcaccctcaggccacccataaaacctctctgagctccagtgac ttgtctgagaatagggagctggatccttccctcacagaaccgaactcccggggttgttgc gatgacaagtcccagcacctcctgaaagccccactctcctccagtggtcacagtggaagg atcatgggagaaacagaagggaagaaagatgaggctgattataagcgactgcagaccttc cctctggtcaggcactcggacatgccagaggagatgcgcgtggagaccatggagctatgt gtcacagcctgtgagaaattctccaacaacaacgaggtattgccagcagtgcaggcggcc cctcgtgcttttgatggcccctgtgttggggatacggccagcaggcatcaacgcaggaag gccattcatcaatctggtggggagagcgagaagtcagtgttgcggcagccgagcgccgcc aagatgatcaaagagacaatggacaagaagttcggctcctcctggcacgtggtgatcggc gagggctttgggtttgagatcacccacgaggtgaagaacctcctctacctgtacttcggg ggcaccctggctgtgtgcgtctggaagtgctcctga >gi568815576r:38766212_38972190|GENSCAN_predicted_peptide_3|1334_aa MGEGTGTSQPISYQQAFDIVNPLKGPPQDPREGDEEEKAKQPPRGRGRSWQPRASREDHV TARSWNASARPAALPPAGAWALRRVRERCRTGRGRREYYGLWVPLSKMELSAVGERVFAA ESIIKRRIRKGRIEYLVKWKGWAIKYSTWEPEENILDSRLIAAFEQKERERELYGPKKRG PKPKTFLLKARAQAEALRISDVHFSVKPSASASSPKLHSSAAVHRLKKDIRRCHRMSRRP LPRPDPQGGSPGLRPPISPFSETVRIINRKVKPREPKRNRIILNLKVIDKGAGGGGAGQG AGALARPKVPSRNRVIGKSKKFSESVLRTQIRHMKFGAFALYKPPPAPLVAPSPGKAEAS APGPGLLLAAPAAPYDARSSGSSGCPSPTPQSSDPDDTPPKLLPETVSPSAPSWREPEVL DLSLPPESAATSKRAPPEVTAAAGPAPPTAPEPAGASSEPEAGDWRPEMSPCSNVVVTDV TSNLLTVTIKEFCNPEDFEKVAAGVAGAAGGGGSIGASKWSASAGLSSKWSSPGQPWCMK RHQAHRLEVGSAHGGPSGWDRGCLAFHLQAPDDSRGHLFARPRTAWGPERFCFGSGSGLV HQSPSWPAPAPCVFHTSAIGAAGFWEDRLQKGLVACRGRAQGHLIHSGGVTKEAAGGGPV LASRHPHALSISQGSWSSAERYTSQALSCTLAFHSCINPPVVQPRKPERVAESFAPGDSI RREATGTLEGLLLTVYGELDLGVRNGPWQPGRGPLSSPFNNNFATQQQSGGSSGRGDSSS SGSGSGSGSGSRACPARPSAPGLRAPTPPPRLPGASGAPAARLTLKFLAVLLAAGMLAFL GAVICIIASVPLAASPARALPGGADNASVASGAAASPGPQRSLSALHGAGGSAGPPALPG APAASAHPLPPGPLFSRFLCTPLAAACPSGAQQGDAAGAAPGEREELLLLQSTAEQLRQT ALQQEARIRADQDTIRELTGKLGRCESGLPRGLQGAGPRRDTMADGPWDSPALILELEDA VRALRDRIDRLELISVSLPCEQQELPARVNLSAAPAPVSAVPTGLHSKMDQLEGQLLAQV LALEKERVALSHSSRRQRQEVEKELDVLQGRVAELEHGSSAYSPPDAFKISIPIRNNYMY ARVRKALPELYAFTACMWLRSRSSGTGQGTPFSYSVPGQANEIVLLEAGHEPMELLINDK VAQLPLSLKDNGWHHICIAWTTRDGLWSAYQDGELQGSGENLAAWHPIKPHGILILGQEQ DTLGGRFDATQAFVGDIAQFNLWDHALTPAQVLGIANCTAPLLGNVLPWEDKLVEAFGGA TKAAFDVCKGRAKA >gi568815576r:38766212_38972190|GENSCAN_predicted_CDS_3|4005_bp atgggggaaggaacaggaacctcacagccgatctcctatcagcaagcttttgatattgtc aatccccttaaggggccccctcaagaccccagggaaggggatgaggaggaaaaggcaaag caaccgccccgggggagggggaggtcctggcaaccgcgcgccagccgcgaggatcacgtg acggcccgcagctggaacgcgagcgcgcgccccgccgcgctcccgcccgccggggcctgg gcgctgcggcgcgtgcgcgagcggtgccgcaccggccgcgggcgcagggagtattatggg ctgtgggtgccgctgagcaagatggagctgtctgcagtgggcgagcgggtcttcgcggcc gaatccatcatcaaacggcggatccgaaagggacgcatcgagtacctggtgaaatggaag gggtgggcgatcaagtacagcacttgggagcccgaggagaacatcctggactcgcggctc attgcagccttcgaacaaaaggagagggagcgtgagctgtatgggcccaagaagagggga cccaaacccaaaactttcctcctgaaggcgcgggcccaggccgaggccctccgcatcagt gatgtgcatttctctgtcaagccgagcgccagtgcctcctcgcccaagctgcactccagc gcagccgtgcaccggctcaagaaggacatccgccgctgccaccgtatgtcccgccgtccc ctgccccgcccggacccgcaggggggcagccccggactgcgcccgcccatttcgcccttc tcggagacggtgcgcatcatcaaccgcaaggtgaagccgcgggagcccaagcggaaccgc atcatcctgaacctgaaggtgatcgacaagggcgctggcggcgggggcgccgggcagggg gccggggcgctggcccgccccaaagtcccctcgcggaaccgcgttataggcaagagcaag aagttcagcgagagcgtcctgcgtacacagatccgccacatgaagttcggcgcctttgcg ctgtacaagcctccgcccgcccccctggtagccccgtcccccggcaaggctgaggcctca gccccgggccctgggctacttctggccgcccccgccgccccctacgacgcccgcagctct ggctcctccggctgcccctcgcctacaccacagtcctctgaccccgacgacacgcccccc aagctcctccccgagaccgtgagcccatccgcccccagctggcgcgagccggaggtgctc gacctgtccctccctcccgagtcggcagccaccagcaagcgggcaccgcctgaggtcaca gctgctgccggcccggcacctcccacggcccctgagcccgccggtgcctcctccgagccc gaggctggggactggcgccccgagatgtcaccctgctccaatgtggtcgtcaccgatgtc accagcaacctcctgacggtcacaatcaaggaattctgcaaccctgaggatttcgagaag gtggctgctggggtagcaggcgccgctgggggcggtggcagcattggggcgagcaagtgg tctgcttctgctggcctgagctccaagtggagcagcccgggccagccttggtgcatgaag aggcaccaggcacaccgccttgaggtgggcagtgcccatgggggcccgagtggatgggac cgagggtgcctggcgttccacctgcaagctcccgatgactcccgtggccacctgtttgcc aggcctcggacagcctggggtcctgagcgcttctgctttggcagcggctctggcctggtc caccagtctccatcctggcctgcacctgccccctgtgtcttccacaccagtgccataggg gctgctggcttctgggaggacagacttcaaaaaggcctggtggcctgcagaggccgagca caaggacacctgatccacagcggtggggtcacaaaggaagcagcagggggaggccccgtc ctggcctccagacatccacacgcgctgagcatcagccagggaagctggagctctgctgag cgctacacgagccaggctctgagctgcacactcgcatttcactcttgcataaatcctccc gtggtgcagccaaggaagccggagagagtggctgagtcatttgctccaggtgactctatt agaagagaggccacgggcaccttggagggactactcctgaccgtgtatggagaactggac cttggtgtaaggaatggaccatggcagccagggaggggcccccttagctctccattcaac aacaactttgccactcagcagcagagcggcggcagcagcggccgcggcgacagctccagc tccggctccggctccggctccggctccggctcccgcgcctgccccgctcggcccagcgcg cccgggctccgcgccccgaccccgccgccgcgcctgccgggggcctcgggcgcccccgcc gcccgcctcacgctgaagttcctggccgtgctgctggccgcgggcatgctggcgttcctc ggtgccgtcatctgcatcatcgccagcgtgcccctggcggccagcccggcgcgggcgctg cccggcggcgccgacaatgcttcggtcgcctcgggcgccgccgcgtccccgggcccgcag cggagcctgagcgcgctgcacggcgcgggcggttcagccgggccccccgcgctgcccggg gcacccgcggccagcgcgcacccgctgccgcccgggcccctgttcagccgcttcctgtgc acgccgctggctgctgcctgcccgtcgggggcccagcagggggacgcggcgggcgctgcg ccgggcgagcgcgaagagctgctgctgctgcagagcacggccgagcagctgcgccagacg gcgctgcagcaggaggcgcgcatccgcgccgaccaggacaccatccgtgagctcaccggc aagctgggccgctgcgagagcggcctgccgcgcggcctccagggcgccgggccccgccgc gacaccatggccgacgggccctgggactcgcctgcgctcattctggagctggaggacgcc gtgcgcgccctgcgggaccgcatcgaccgcctggagttgatttctgtttctctgccctgt gaacagcaggagcttccagcccgtgtgaacctctcagctgccccagccccagtctctgct gtgcccaccggcctacactccaagatggaccagctggaggggcagctgctggcccaggtg ctggcactggagaaggagcgtgtggccctcagccacagcagccgccggcagaggcaggaa gtggaaaaggagttggacgtcctgcagggtcgtgtggctgagctggagcacgggtcctca gcctacagtcctccagatgccttcaagatcagcatccccatccgtaacaactacatgtac gcccgcgtgcggaaggctctgcccgagctctacgcattcaccgcctgcatgtggctgcgg tccaggtccagcggcaccggccagggcacccccttctcctactcagtgcccgggcaggcc aacgagattgtactgctagaggcgggccatgagcccatggagctgctgatcaacgacaag gtggcccagctgcccctgagcctgaaggacaatggctggcaccacatctgcatcgcctgg accacaagggatggcctatggtctgcctaccaggacggggagctgcagggctccggtgag aacctggctgcctggcaccccatcaagcctcatgggatccttatcttgggccaggagcag gataccctgggtggccggtttgatgccacccaggcctttgtcggtgacattgcccagttt aacctgtgggaccacgccctgacaccagcccaggtcctgggcattgccaactgcactgcg ccactgctgggcaacgtccttccctgggaagacaagttggtggaggcctttgggggtgca acaaaggctgccttcgatgtctgcaaggggagggccaaggcatga >gi568815576r:38766212_38972190|GENSCAN_predicted_peptide_4|234_aa MGWVWWLMAVIPALLEAKASWLVQEWMPDAGCTNQILSSGDLESRTSGYHRLALAVRPGR KLSLASGAAYICRQRTLLPHGWACRFCGLVTLTCTEMRALGTTGTEPPGPGESKEGFLEE EWQSGPLLLQQECPQLLLGKGLCWLGLPLGQSGWLLPLPSEGSSGESCHARTPFPIIPNP LHWFLPADVKIKSWPDFPLSFRFYPCALRPPRSPLRLLICDLEEIARVLGGMPA >gi568815576r:38766212_38972190|GENSCAN_predicted_CDS_4|705_bp atgggctgggtgtggtggctcatggctgtaatcccagcccttttggaggccaaggcttcc tggttggtccaggagtggatgcctgatgcaggctgcacaaatcagattctctcttctgga gatttagaatcgagaaccagtggttatcataggttagccttggctgtcagacctggaagg aagctctctttggcctctggagctgcgtatatctgtagacagcggaccctgctgccacat ggctgggcctgccgcttctgtggcttggtcacactcacatgcacggaaatgagagcactg gggactactggaacagaaccgcctggcccgggagaatccaaggaaggcttcctggaggag gaatggcagagcggacctctgctactgcagcaggaatgcccccagctgctgctggggaag ggcttatgttggctgggtttgcctttggggcagagtggctggctgctccccctgcccagc gagggtagttctggggagagttgccacgctcggaccccatttcccatcatccccaaccct ttgcattggttcttgccagcagatgtgaagataaaaagctggccagacttcccactctcc ttccgcttctatccctgtgcgctgaggcctccgaggagccctctgaggcttctgatttgc gacctggaggaaattgcccgtgtgctcggaggaatgcctgcatga >gi568815576r:38766212_38972190|GENSCAN_predicted_peptide_5|410_aa MESSLDKRHTVERQPGKKMQDPNREHRMQQVPGSLDQAHPALATGEDTGVAVLEPALWTA ECYSGSFSRGAKCGQLYIPIVTPQCLVVQWHKTYLCYEVERLDNGTSVKMDQHRGFLHNQ VTDPAIRIRAGPFQSRDIHSTASVWRDQAKNLLCGFYGRHAELRFLDLVPSLQLDPAQIY RVTWFISWSPCFSWGCAGEVRAFLQENTHVRLRIFAARIYDYDPLYKEALQMLRDAGAQV SIMTYDEFKHCWDTFVDHQGCPFQPWDGLDEHSQALSGRLRAILQATSLCPLSTLSPPAP FNPPALPESGKLKDGPQSLRKAETWVEQQNKRSSSKKCKQTVHHHLQLLTDASKAVCSRS KSSVRAVVATGDPGTFAPLELPHRMASGFQNTKSGQDSAYITLANALLAK >gi568815576r:38766212_38972190|GENSCAN_predicted_CDS_5|1230_bp atggaatcttccctggacaagcgacataccgtggagagacagccagggaagaagatgcag gaccctaacagagagcacaggatgcagcaggtgccagggagcctggaccaggcacatcct gcactggccacaggggaggacacaggggtggctgtcctggagcctgctctctggaccgct gagtgttattcagggtctttctccaggggagcaaagtgtggccagctctacatccccatt gtcactccacagtgtctggtggttcagtggcataagacctacctgtgctacgaagtggag cgcctggacaatggcacctcggtcaagatggaccagcacaggggctttctacacaaccag gtgaccgacccagccatccgaatccgggcagggcccttccaatccagggacattcatagc acagcctctgtctggagagaccaggctaagaatcttctctgtggcttttacggccgccat gcggagctgcgcttcttggacctggttccttctttgcagttggacccggcccagatctac agggtcacttggttcatctcctggagcccctgcttctcctggggctgtgccggggaagtg cgtgcgttccttcaggagaacacacacgtgagactgcgtatcttcgctgcccgcatctat gattacgaccccctatataaggaggcactgcaaatgctgcgggatgctggggcccaagtc tccatcatgacctacgatgaatttaagcactgctgggacacctttgtggaccaccaggga tgtcccttccagccctgggatggactagatgagcacagccaagccctgagtgggaggctg cgggccattctccaggccacctccctgtgccctctttccactctctcacctcctgctcca ttcaacccccctgctcttccagaatcagggaaactgaaggatgggcctcagtctctaagg aaggcagagacctgggttgagcagcagaataaaagatcttcttccaagaaatgcaaacag accgttcaccaccatctccagctgctcacagacgccagcaaagcagtatgctcccgatca aaaagctcagtgagggctgtcgtggccactggcgacccaggcacatttgctccgcttgag ctccctcaccgaatggcatctgggttccaaaataccaagtctggtcaagactctgcctac atcacacttgctaatgccctgttggccaag