GENSCAN 1.0 Date run: 7-Nov-116 Time: 20:36:11 Sequence gi568815597r:235350209_235604252 : 254044 bp : 44.15% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 12212 12314 103 0 1 83 36 61 0.012 -0.37 1.02 Intr + 17171 17296 126 2 0 84 99 45 0.080 4.99 1.03 Intr + 29811 29941 131 0 2 83 115 115 0.435 14.04 1.04 Term + 30016 30029 14 2 2 77 43 17 0.600 -5.54 1.05 PlyA + 32058 32063 6 1.05 2.04 PlyA - 32855 32850 6 1.05 2.03 Term - 34434 33809 626 1 2 46 42 287 0.716 14.45 2.02 Intr - 35064 34771 294 1 0 39 20 166 0.053 1.88 2.01 Init - 36711 36411 301 0 1 88 -8 241 0.133 11.91 2.00 Prom - 37438 37399 40 -7.76 3.00 Prom + 46287 46326 40 -3.76 3.01 Init + 58996 59036 41 0 2 85 62 39 0.704 0.66 3.02 Intr + 64225 64410 186 1 0 93 115 160 0.978 18.00 3.03 Term + 69265 69424 160 1 1 48 32 97 0.235 -2.49 3.04 PlyA + 75023 75028 6 1.05 4.00 Prom + 80837 80876 40 -4.86 4.01 Init + 82777 82963 187 0 1 85 92 284 0.967 27.72 4.02 Intr + 83851 84003 153 2 0 68 66 56 0.509 1.44 4.03 Intr + 86336 86400 65 0 2 101 71 51 0.702 3.14 4.04 Intr + 87114 87266 153 2 0 75 83 181 0.775 16.57 4.05 Term + 91606 91659 54 0 0 88 48 39 0.406 -2.54 4.06 PlyA + 93636 93641 6 1.05 5.15 PlyA - 94027 94022 6 1.05 5.14 Term - 95876 95810 67 1 1 46 49 84 0.229 -2.29 5.13 Intr - 100132 100032 101 1 2 80 62 74 0.290 2.91 5.12 Intr - 102938 102882 57 2 0 24 111 70 0.774 1.98 5.11 Intr - 104101 103948 154 0 1 36 91 112 0.703 6.37 5.10 Intr - 105476 105351 126 1 0 104 106 44 0.939 7.59 5.09 Intr - 108578 108395 184 0 1 72 78 130 0.790 9.25 5.08 Intr - 115506 115428 79 0 1 97 110 -16 0.590 0.62 5.07 Intr - 120752 120642 111 2 0 48 98 53 0.650 2.88 5.06 Intr - 129941 129846 96 2 0 64 77 131 0.895 9.81 5.05 Intr - 134307 134114 194 1 2 47 69 251 0.991 18.31 5.04 Intr - 139060 138960 101 0 2 40 111 55 0.861 2.75 5.03 Intr - 144620 144473 148 0 1 112 90 57 0.897 7.59 5.02 Intr - 153653 153480 174 0 0 73 18 87 0.459 0.11 5.01 Init - 154044 153933 112 0 1 26 94 265 0.986 19.28 5.00 Prom - 167919 167880 40 -5.46 6.00 Prom + 174318 174357 40 -3.96 6.01 Init + 181214 181234 21 1 0 99 50 17 0.525 -2.27 6.02 Term + 183728 183796 69 1 0 119 39 111 0.919 7.24 6.03 PlyA + 184170 184175 6 1.05 7.06 PlyA - 184182 184177 6 1.05 7.05 Term - 188609 188419 191 0 2 41 52 137 0.411 3.01 7.04 Intr - 194242 194211 32 0 2 116 74 16 0.621 0.77 7.03 Intr - 200052 199876 177 2 0 58 75 171 0.880 11.83 7.02 Intr - 202029 201923 107 0 2 144 -57 115 0.360 1.61 7.01 Init - 202707 202705 3 0 0 71 101 0 0.249 -0.40 7.00 Prom - 206947 206908 40 -1.96 8.00 Prom + 213871 213910 40 -1.86 8.01 Init + 216475 216528 54 0 0 84 59 75 0.328 5.48 8.02 Intr + 225024 225092 69 2 0 62 97 32 0.065 0.88 8.03 Intr + 241398 241609 212 2 2 47 74 133 0.348 5.51 8.04 Intr + 243567 243615 49 0 1 66 89 52 0.322 1.78 8.05 Intr + 244742 244793 52 1 1 76 81 45 0.314 1.08 8.06 Term + 244947 244966 20 1 2 122 50 -9 0.275 -2.82 8.07 PlyA + 246374 246379 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:235350209_235604252|GENSCAN_predicted_peptide_1|124_aa XLPLSTRQIKQELAEEYETTKSPVPPAYSLQLLLSSRTSLRWRRDFLQHFRPEPQASLLG SWLEGLLLGTPGVSAGRSHILDSGYIIMSDTLTADVIGRRVEVNGEHATVRFAGVVPPVA AVSV >gi568815597r:235350209_235604252|GENSCAN_predicted_CDS_1|375_bp nggttacccctaagcaccagacagatcaaacaggagcttgctgaggagtatgagaccact aagagtccagtgcccccagcctacagcctccaactcctcctaagcagccgtacctcactc cggtggaggcgggacttcctacagcacttccggccagagcctcaagcttcgctgctgggc agttggctggaggggctgctgctgggaacacctggagtctccgcgggcagatctcatatt ttggattctggatatattataatgagtgacactttgacagcggatgtcattggtcgaaga gttgaagttaatggagaacatgcaacagtacgttttgctggtgttgtccctcccgtggca gctgtttctgtgtaa >gi568815597r:235350209_235604252|GENSCAN_predicted_peptide_2|406_aa MGKKQSRKTGNSKKQSASPPPKEHSSSPATEQSWTENDFDKLREEGFRRSNYSELQEEIQ TKGKEVKNFEKNLDECITRITNTEKCLKELMELKAKAQELPPHHTYSKIDHIVGSKALLS KCKRSDIITNCLSDHSAIKLEFRIKKLTQNRSTTWKLNNLLLNDYWVHNEMKTEIKMFFE TNENKDTTYQNLWDNIQSKIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVE SLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEQVPFLLKLFQSIEKEGILPN SFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFI PGMQGWFNICKSINVIQHINRTKDKNHMIISIDAEKAFDKIQQPSC >gi568815597r:235350209_235604252|GENSCAN_predicted_CDS_2|1221_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaagcagagcgcctctcctcct ccaaaggaacacagctcctcaccagcaacggaacaaagctggacagagaatgactttgac aaattgagagaagaaggcttcagacgatcaaactactccgagctacaggaggaaattcaa accaaaggcaaagaagttaaaaactttgaaaaaaatttagacgaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaagccaaggctcaagaacta ccaccacaccacacctattccaaaattgaccacatagttggaagtaaagctctcctcagc aaatgtaaaagatcagacattataacaaactgtctctcagaccacagtgcaatcaaacta gaattcaggattaagaaactcactcaaaaccgctcaactacatggaaactgaacaacctg ctcctgaatgactactgggtacataatgaaatgaagacagaaataaagatgttctttgaa accaacgagaacaaagacacaacataccagaatctctgggacaacattcaaagcaaaata caaactaccatcagagaatactacaaacacctctacgcaaataaactagaaaatctagaa gaaatggataaattcctcgacacatacaccctcccaagactaaaccaggaagaagttgaa tctctgaatagaccaataacaggctctgaaattgtggcaataatcaatagcttaccaacc aaaaagagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaa caggtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaac tcattttatgaggccagcatcatcctgataccaaagccaggcagagacacaaccaaaaaa gagaattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactg gcaaaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatc cctgggatgcaaggctggttcaatatatgcaaatcaataaatgtaatccagcatataaac agaaccaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaa attcaacaaccttcatgctaa >gi568815597r:235350209_235604252|GENSCAN_predicted_peptide_3|128_aa MGQDDDFTSSMCNWHPTGGSFIRPNKVNFGTDFLTAIKNRYVLEDGPEEDRKEQIVTIGN KPVETIGFDSIMKQQSQLSKLQEVSLRNCAVSCAGEKGGVAEACPSILFTESLLLESDYG FYTCARDC >gi568815597r:235350209_235604252|GENSCAN_predicted_CDS_3|387_bp atgggtcaggatgatgattttaccagcagcatgtgtaattggcacccgacaggaggatcc tttattcgtccgaacaaggtaaattttggaacagactttcttactgcaattaagaaccgc tatgtgttagaagatggaccagaggaagatagaaaagagcaaattgttacaattggaaat aaacctgtggagactatcggttttgactctattatgaaacagcaaagtcagctgagcaag ttgcaagaagtttctctgaggaactgtgcagtaagttgtgctggtgaaaaaggaggagtt gctgaagcatgtcctagtatccttttcaccgagagcttgttattggaatctgactatgga ttttatacctgtgccagagactgctaa >gi568815597r:235350209_235604252|GENSCAN_predicted_peptide_4|203_aa MQKDASKFVDLCVLQKCSTSNCIISAKDHTSMRMNVAKASEVTGRFNSQFKTCAICRTVC RMGLKQSYELSVRHLFAETVTELNSCGGSYEHPGRNSQQRPPEPEPSFSSRCCGCKTSMF PSLKYLVVNDNQISQWSFFNELEKLPSLRALSCLRNPLTKEDKEAETARLLIIASIGQLK TLNKCENMVHLKIGNSKHSNHLC >gi568815597r:235350209_235604252|GENSCAN_predicted_CDS_4|612_bp atgcagaaagacgccagcaagttcgtggatctgtgcgtgctgcagaaatgctccaccagc aactgcatcatcagtgccaaggaccacacatccatgcggatgaacgtggccaaggccagt gaggtcacgggcaggtttaacagccagtttaaaacctgtgctatctgcaggactgtttgc aggatgggcctgaaacagtcttatgaactgtccgtgaggcacttgtttgctgaaacagta actgagctgaattcttgtggtggttcctatgaacacccgggaagaaacagccagcagagg ccgcctgagcctgaaccgagtttctcttccaggtgctgcgggtgcaaaacgtccatgttc ccatccttgaagtacctggtagtaaacgacaatcagatatcacaatggtcgtttttcaat gagctagagaagttaccaagtctacgggctttgtcctgcctaagaaaccccctgaccaaa gaggacaaagaagcagagacggcgcgactactcattatcgccagcattggccagctgaag acgctgaacaaatgtgagaatatggtgcacctgaagattgggaactcaaaacacagcaac cacttatgctga >gi568815597r:235350209_235604252|GENSCAN_predicted_peptide_5|567_aa MRNWLVLLCPCVLGAALHLWLRLRSPPPACASGAGPAGLVSDVRRRPFARERNLVCPSAA SAFCFGSSGSEVVFFPPKLLVFLGQDWGFSAVDNVYQLALFPQWKSTHYDVVVGVLSARN NHELRNVIRSTWMRHLLQHPTLSQRVLVKFIIGAHGCEVPVEDREDPYSCKLLNITNPVL NQEIEAFSLSEDTSSGLPEDRVVSVSFRVLYPIVITSLGVFYDANDVGFQRNITVKLYQA EQEEALFIARFSPPSCGVQVNKLWYKPVEQFILPESFEGTIVWESQDLHGLVSRNLHKVT VNDGGGVLRVITAGEGALPHEFLEGVEGVAGGFIYTIQEGDALLHNLHSRPQRLIDHIRN LHEEDALLKEESSIYDDIVFVDVVDTYRNVPAKLLNFYRWTVETTSFNLLLKTDDDCYID LEAVFNRIVQKNLDGPNFWWGKLNWAVDRTGKWQELEYPSPAYPAFACGSGYVISKDIVK WLASNSGRLKTYQGEDVSMGIWMAAIGPKRYQDSLWLCEKTCETGMLSSPQYSPWELTEL WKLKERHTLPSMAATCGTEHLESGEYD >gi568815597r:235350209_235604252|GENSCAN_predicted_CDS_5|1704_bp atgcgaaactggctggtgctgctgtgcccgtgtgtgctcggggccgcgctgcacctctgg ctgcggctgcgctccccgccgcccgcctgcgcctccggggccggccctgcaggcctggtt agtgatgtccgacgccgcccgttcgcccgggagcggaacctcgtgtgcccttcagccgcc agcgctttctgctttgggagctctggctctgaagttgttttcttcccccccaagttgttg gtctttctcggtcaagactggggttttagtgccgtggataatgtgtatcagttggcctta tttcctcagtggaaatctactcactatgatgtggtagttggcgtgttgtcagctcgcaat aaccatgaacttcgaaacgtgataagaagcacctggatgagacatttgctacagcatccc acattaagtcaacgtgtgcttgtgaagttcataataggtgctcatggctgtgaagtgcct gtggaagacagggaggatccttattcctgtaaactactcaacatcacaaatccagttttg aatcaggaaattgaagcgttcagtctgtccgaagacacttcatcggggctgcctgaggat cgagttgtcagcgtgagtttccgagttctctaccccatcgttattaccagtcttggagtg ttctacgatgccaatgatgtgggtttccagaggaacatcactgtcaaactttatcaggca gaacaagaggaggccctcttcattgctcgcttcagtcctccaagctgtggtgtgcaggtg aacaagctgtggtacaagcccgtggaacaattcatcttaccagagagctttgaaggtaca atcgtgtgggagagccaagacctccacggccttgtgtcaagaaatctccacaaagtgaca gtgaatgatggagggggagttctcagagtcattacagctggggagggtgcattgcctcat gaattcttggaaggtgtggagggagttgcaggtggttttatatatactattcaggaaggt gatgctctcttacacaaccttcattctcgccctcaaagacttattgatcatataaggaat ctccatgaggaagatgccttactgaaggaggaaagcagcatctatgatgatattgttttt gtggatgttgtcgacacttatcgtaatgttcctgcaaaattattgaacttctatagatgg actgtggaaacaacgagcttcaatttgttgctgaagacagatgatgactgttacatagac ctcgaagctgtatttaataggattgtccaaaagaatctggatgggcctaatttttggtgg ggaaaactgaattgggcagttgaccgaaccggaaagtggcaggagttggagtacccgagc cccgcttaccctgcctttgcatgtgggtcaggatatgtgatctccaaggacatcgtcaag tggctggcaagcaactcggggaggttaaagacctatcagggtgaagatgtaagcatgggc atctggatggctgccataggacctaaaagataccaggacagtctgtggctgtgtgagaag acctgtgagacaggaatgctgtcttctcctcagtattctccgtgggaactgacggaactg tggaaactgaaggaacgccatacactgcccagtatggcagccacatgtggtactgagcac ctggaaagcggcgagtatgactga >gi568815597r:235350209_235604252|GENSCAN_predicted_peptide_6|29_aa MGLGPGPPTQCEDEDEDEDFYDDPLSLNE >gi568815597r:235350209_235604252|GENSCAN_predicted_CDS_6|90_bp atggggctggggcctggccctcctactcaatgtgaagatgaagatgaggatgaagacttt tatgatgatccactttcacttaatgaatag >gi568815597r:235350209_235604252|GENSCAN_predicted_peptide_7|169_aa MVSQAAADLLAYCEAHVREDPLIIPVPASENPFREKKRKPYSGSLRKVIHGCPKERLDID ELALGLSKINLKVLPKPNSYDPDERGEYTVGAELFVPQTHTGPRPGASLIHNLSEEQDIR KIGGLFKTLPLTSSSHFISSLTTYRYAFLTGFYSKDLIIEPANTSYTNA >gi568815597r:235350209_235604252|GENSCAN_predicted_CDS_7|510_bp atggtctcccaggcagctgcggacctcctggcctactgtgaagctcacgtgcgggaagat cctctcatcattccagtgcctgcatcagaaaacccctttcgcgagaagaagaggaagccg tattctggtagtttaaggaaggtgattcatggatgccccaaggaaaggctagacatcgat gaattagctcttggattgtctaagatcaacctgaaggtcctacctaaaccaaactcatat gatcccgatgaaagaggagaatacacagtgggagcagagctctttgtgccacagacccat actggtccgcggcctggggcgtccctcatccataacctcagtgaagaacaagacatccga aaaataggagggctattcaagactctacccctcacttcctcctcccattttatcagcagc ctcacaacttacaggtatgccttcctcacaggcttttactccaaagacctcattattgaa cctgcaaacacatcatacaccaatgcctga >gi568815597r:235350209_235604252|GENSCAN_predicted_peptide_8|151_aa MAVLSNENGDATSAEDLGWQKAIRIQYAPPLPMGSHPHKPITFMRLHIPPGPKKKNGSRN REELQQISTRTLATGNTDWALGFPEDEARGKRDHLLRGGTWGHVLTQLFLPSGFVVSLAS GVKPQTFVLSLSLKGPSSLNCEWGGSAHPFT >gi568815597r:235350209_235604252|GENSCAN_predicted_CDS_8|456_bp atggcggtgctatctaatgagaacggggatgctaccagtgctgaagatctgggttggcag aaagcaatacgcattcagtatgctcctccacttcccatggggtcacatccgcataaaccc atcacattcatgcgtctgcacatcccacctggacccaagaagaaaaatggaagtaggaac agagaggagctgcaacaaatctccacacgcacactggctaccggcaacactgactgggct ctcggctttccagaagatgaggcaagggggaaaagggaccatttgcttaggggtggcacc tggggccacgtgctcacacagctctttcttcccagtgggtttgtggtctcgctggcttca ggagtgaagccgcagaccttcgtgctatccctgagcctgaagggacccagcagcttaaac tgcgagtggggaggaagtgcacatccattcacatag