GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:35:24 Sequence gi568815587f:113875326_114089760 : 214435 bp : 46.74% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 335 120 216 2 0 66 60 217 0.105 13.39 1.00 Prom - 20210 20171 40 -4.46 2.00 Prom + 26246 26285 40 -6.46 2.01 Init + 29624 29660 37 1 1 53 109 94 0.489 6.28 2.02 Intr + 33970 34130 161 2 2 68 79 121 0.620 8.91 2.03 Intr + 56433 56542 110 2 2 75 81 98 0.359 6.88 2.04 Intr + 56964 57133 170 0 2 49 97 82 0.997 4.79 2.05 Intr + 57611 57768 158 0 2 77 51 159 0.895 10.83 2.06 Intr + 67657 67867 211 0 1 70 99 236 0.241 21.39 2.07 Intr + 69248 69430 183 0 0 98 89 160 0.997 16.86 2.08 Term + 70577 70812 236 0 2 70 49 194 0.999 10.18 2.09 PlyA + 71018 71023 6 1.05 3.06 PlyA - 71504 71499 6 1.05 3.05 Term - 72308 72165 144 2 0 40 33 70 0.296 -5.39 3.04 Intr - 72771 72725 47 1 2 79 80 33 0.379 -0.27 3.03 Intr - 74080 73968 113 0 2 27 105 124 0.538 8.02 3.02 Intr - 77291 77127 165 1 0 57 98 73 0.158 4.28 3.01 Init - 89308 89172 137 1 2 73 -5 156 0.094 4.41 3.00 Prom - 91530 91491 40 -4.16 4.00 Prom + 92205 92244 40 -3.56 4.01 Init + 100001 100067 67 1 1 56 121 186 0.988 17.93 4.02 Intr + 102446 102597 152 0 2 80 65 145 0.967 11.28 4.03 Intr + 103908 103952 45 2 0 105 99 101 0.997 11.51 4.04 Intr + 105878 105987 110 1 2 112 62 195 0.999 18.38 4.05 Intr + 107795 107964 170 2 2 84 81 282 0.963 26.69 4.06 Intr + 110633 110850 218 0 2 99 73 147 0.529 12.52 4.07 Intr + 111193 111403 211 0 1 91 87 359 0.656 34.59 4.08 Intr + 111500 111721 222 0 0 62 94 159 0.941 12.00 4.09 Term + 114140 114438 299 0 2 96 48 461 0.993 38.33 4.10 PlyA + 114965 114970 6 1.05 5.13 PlyA - 115057 115052 6 1.05 5.12 Term - 119733 119702 32 1 2 109 50 7 0.131 -2.98 5.11 Intr - 120745 120545 201 2 0 85 94 46 0.337 4.16 5.10 Intr - 125570 125372 199 2 1 93 47 104 0.259 5.72 5.09 Intr - 139735 139650 86 2 2 132 89 8 0.040 4.84 5.08 Intr - 140132 140047 86 1 2 67 16 51 0.014 -4.74 5.07 Intr - 146253 146152 102 2 0 15 119 121 0.839 7.19 5.06 Intr - 147795 147659 137 0 2 109 94 -21 0.835 -0.03 5.05 Intr - 149388 149321 68 1 2 100 90 58 0.887 5.82 5.04 Intr - 152231 152103 129 0 0 76 97 20 0.197 2.37 5.03 Intr - 154107 153967 141 1 0 51 107 29 0.117 1.42 5.02 Intr - 162864 162729 136 0 1 77 87 63 0.645 5.24 5.01 Init - 164864 164805 60 2 0 70 62 68 0.731 3.65 5.00 Prom - 166705 166666 40 -4.76 6.00 Prom + 170526 170565 40 -3.76 6.01 Sngl + 187976 189247 1272 1 0 99 47 1598 0.474 152.52 6.02 PlyA + 189359 189364 6 -0.45 7.00 Prom + 189866 189905 40 -4.06 7.01 Init + 195367 195424 58 0 1 89 77 60 0.293 6.47 7.02 Term + 201326 201405 80 0 2 95 45 57 0.146 0.03 7.03 PlyA + 204198 204203 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:113875326_114089760|GENSCAN_predicted_peptide_1|72_aa MPGAGHVGQASGARCGRGRDGAGWRAGAGPSPERLGRRRLRRETGDPRLGASAMTAELQQ DDAAGAADGHGS >gi568815587f:113875326_114089760|GENSCAN_predicted_CDS_1|216_bp atgccgggggcggggcatgtgggccaagcctccggcgcgcgctgcgggcgggggcgggac ggagcggggtggagggcgggggcggggcctagtcctgagaggctgggccggcggcggctg cggcgggagaccggtgacccgcggctgggcgcctcggccatgactgcggagctgcagcag gacgacgcggccggcgcggcagacggccacggctcg >gi568815587f:113875326_114089760|GENSCAN_predicted_peptide_2|421_aa MAPLWACILVAAGILATDTHHPQDSALYHLSKQLLQKYHKEVRPVYNWTKATTVYLDLFV HAILDVVWNDEFLSWNSSMFDEIREISLPLSAIWAPDIIINEFVDIERYPDLPYVYVNSS GTIENYKPIQVVSACSLETYAFPFDVQNCSLTFKSILHTVEDVDLAFLRSPEDIQHDKKA FLNDSEWELLSVSSTYSILQSSAGGFAQIQFNVVMRRHPLVYVVSLLIPSIFLMLVDLGS FYLPPNCRARIVFKTSVLVGYTVFRVNMSNQVPRSVGSTPLIGHFFTICMAFLVLSLAKS IVLVKFLHDEQRGGQEQPFLCLRGDTDADRPRVEPRAQRAVVTESSLYGEHLAQPGTLKE VWSQLQSISNYLQTQDQTDQQEAEWLVLLSRFDRLLFQSYLFMLGIYTITLCSLWALWGG V >gi568815587f:113875326_114089760|GENSCAN_predicted_CDS_2|1266_bp atggctcccctgtgggcctgcatcctggtggctgcaggaattctagccacagatacacat catccccaggattctgctctgtatcatctcagcaagcagctattacagaaatatcataaa gaagtgagacctgtttacaactggaccaaggccaccacagtctacctggacctgttcgtc catgctatattggatgtggtctggaatgatgaatttttatcctggaactccagcatgttt gatgagattagagagatctccctacctctaagtgccatctgggcccccgatatcatcatc aatgagtttgtggacattgaaagataccctgaccttccctatgtttatgtgaactcatct gggaccattgagaactataagcccatccaggtggtctctgcgtgcagtttagagacatat gcttttccatttgatgtccagaattgcagcctgaccttcaagagcattctgcatacagtg gaagacgtagacctggcctttctgaggagcccagaagacattcagcatgacaaaaaggcg tttttgaatgacagtgagtgggaacttctatctgtgtcctccacatacagcatcctgcag agcagcgctggaggatttgcacagattcagtttaatgtggtgatgcgcaggcaccccctg gtctatgtcgtgagtctgctgattcctagcatctttctcatgctggtggacctggggagc ttctacctgccacccaactgccgagccaggattgtgttcaagaccagtgtgctggtgggc tacaccgtcttcagggtcaacatgtccaaccaggtgccacggagtgtagggagcacccct ctgattgggcacttcttcaccatctgcatggccttcttggttctcagcttagctaagtcc atcgtgttggtcaaattcctccatgatgagcagcgtggtggacaggagcagcccttcttg tgccttcgaggggacaccgatgctgacaggcctagagtggaacccagggcccaacgtgct gtggtaacagagtcctcgctgtatggagagcacctggcccagccaggaaccctgaaggaa gtctggtcgcagcttcaatctatcagcaactacctccaaactcaggaccagacagaccaa caggaggcagagtggctggtcctcctgtcccgctttgaccgactgctcttccaaagctac cttttcatgctggggatctacaccatcactctgtgctccctctgggcactgtggggcggc gtgtga >gi568815587f:113875326_114089760|GENSCAN_predicted_peptide_3|201_aa MREQRQAFSPIRSGNGGLTGSGAQQTTCWIQKDGSQRQVCNGGKQHLALFKEDEQGDQEL AVHMQPGLPVTLRRDDLPARLTHTPGLQRPTQAVGEVQRVRRRKCTGDQFGGQASLLKGC LLEADVKNLGNVICEWNKAAVEFHGQLALSTNWEKRKSGHRHEHAQRKGHVKTKQEVSNL QAKDRAQKKATHPVCTLILGF >gi568815587f:113875326_114089760|GENSCAN_predicted_CDS_3|606_bp atgcgggaacaaaggcaagcctttagcccaatcaggagtggcaatgggggcctcactgga tcaggagcacagcagacaacctgctggatccagaaggatggaagtcagcggcaggtctgc aacggcggcaaacagcacctggcgctcttcaaagaggatgagcagggtgaccaggagctg gctgtccacatgcagcccgggttaccagtgacactgaggagggatgaccttcccgctaga cttacacacaccccaggccttcaaaggccaacacaggctgtgggagaggtacagagggta agacgaaggaagtgtaccggggaccagtttggcggccaggcttcactcttaaagggctgc ttgttagaggctgatgtaaagaacctgggcaatgtcatctgcgaatggaacaaagcagca gtggaatttcatggacaattggcgctgagcactaactgggagaagagaaaatctggacac agacatgagcatgcacagaggaaaggtcatgtgaaaacaaagcaagaagtcagcaacctg caagccaaggacagggctcagaagaaagccacccaccctgtttgcaccttgatcttaggc ttctag >gi568815587f:113875326_114089760|GENSCAN_predicted_peptide_4|497_aa MLLWVQQALLALLLPTLLAQGEARRSRNTTRPALLRLSDYLLTNYRKGVRPVRDWRKPTT VSIDVIVYAILNVDEKNQVLTTYIWYRQYWTDEFLQWNPEDFDNITKLSIPTDSIWVPDI LINEFVDVGKSPNIPYVYIRHQGEVQNYKPLQVVTACSLDIYNFPFDVQNCSLTFTSWLH TRSSRLWVLDSKAGLLYSLSVQDINISLWRLPEKVKSDRSVFMNQGEWELLGVLPYFREF SMESSNYYAEMKFYVVIRRRPLFYVVSLLLPSIFLMVMDIVGFYLPPNSGERVSFKITLL LGYSVFLIIVSDTLPATAIGTPLIGVYFVVCMALLVISLAETIFIVRLVHKQDLQQPVPA WLRHLVLERIAWLLCLREQSTSQRPPATSQATKTDDCSAMGNHCSHMGGPQDFEKSPRDR CSPPPPPREASLAVCGLLQELSSIRQFLEKRDEIREVARDWLRVGSVLDKLLFHIYLLAV LAYSITLVMLWSIWQYA >gi568815587f:113875326_114089760|GENSCAN_predicted_CDS_4|1494_bp atgctgctgtgggtccagcaggcgctgctcgccttgctcctccccacactcctggcacag ggagaagccaggaggagccgaaacaccaccaggcccgctctgctgaggctgtcggattac cttttgaccaactacaggaagggtgtgcgccccgtgagggactggaggaagccaaccacc gtatccattgacgtcattgtctatgccatcctcaacgtggatgagaagaatcaggtgctg accacctacatctggtaccggcagtactggactgatgagtttctccagtggaaccctgag gactttgacaacatcaccaagttgtccatccccacggacagcatctgggtcccggacatt ctcatcaatgagttcgtggatgtggggaagtctccaaatatcccgtacgtgtatattcgg catcaaggcgaagttcagaactacaagccccttcaggtggtgactgcctgtagcctcgac atctacaacttccccttcgatgtccagaactgctcgctgaccttcaccagttggctgcac accaggtccagcaggctctgggtactagattccaaagctggcttgctttattctctctca gtccaggacatcaacatctctttgtggcgcttgccagaaaaggtgaaatccgacaggagt gtcttcatgaaccagggagagtgggagttgctgggggtgctgccctactttcgggagttc agcatggaaagcagtaactactatgcagaaatgaagttctatgtggtcatccgccggcgg cccctcttctatgtggtcagcctgctactgcccagcatcttcctcatggtcatggacatc gtgggcttctacctgccccccaacagtggcgagagggtctctttcaagattacactcctc ctgggctactcggtcttcctgatcatcgtttctgacacgctgccggccactgccatcggc actcctctcattggtgtctactttgtggtgtgcatggctctgctggtgataagtttggcc gagaccatcttcattgtgcggctggtgcacaagcaagacctgcagcagcccgtgcctgct tggctgcgtcacctggttctggagagaatcgcctggctactttgcctgagggagcagtca acttcccagaggcccccagccacctcccaagccaccaagactgatgactgctcagccatg ggaaaccactgcagccacatgggaggaccccaggacttcgagaagagcccgagggacaga tgtagccctcccccaccacctcgggaggcctcgctggcggtgtgtgggctgctgcaggag ctgtcctccatccggcaattcctggaaaagcgggatgagatccgagaggtggcccgagac tggctgcgcgtgggctccgtgctggacaagctgctattccacatttacctgctagcggtg ctggcctacagcatcaccctggttatgctctggtccatctggcagtacgcttga >gi568815587f:113875326_114089760|GENSCAN_predicted_peptide_5|458_aa MGLHQNGIYQPLDLGLLNLQGQGKTQNHIHKGPKFGQQVKFQTAPQLSHSGVWTQNVCPA SAPGIGVESSTGVPDAKLDGELFPLEGPHPALTLNFPFSAPDGGRRNTYTLPGCCGTWNA NFSPGSARSAGEMMQEACSWTSWLVQVTCSSIRSQVIQQPLGFLDDQQQPLSGSVRGKEI WGSAAHPCACGYPPLSSSLSHKSLKIPQGHAHAPQSFSYQCISRSGVDNRHFSKFPSDAD AAAQGPDIEATDFSGHGWCQQPSVALPQVQQKLQAVQVVPRGCLLGVHLMCLSVPGIQMG ILRVPVPIHQPSSTQGCCCSPDQRTGDLGAPCESPCAPRPSSTDEITGSCNDEAQYCTDE SLIYVKDISAEGAINRKAKYEVLIQRVCDEPYLNFVVQEASRFIHTNNNKTIVEICQPFT LCQTLYLYQLLYSSQQSIRQCFYDVTFTGPLRSTSKPV >gi568815587f:113875326_114089760|GENSCAN_predicted_CDS_5|1377_bp atgggccttcaccagaatggaatctaccagcccctggatcttggtcttctcaacctccag ggccagggcaagacccagaaccacatccacaagggccccaagtttggtcaacaggtcaag tttcaaacagccccacaattatcccatagtggggtgtggacccagaatgtttgccctgca tcagctcctgggattggtgttgaaagctccacaggtgttcctgatgcaaagctggatgga gaactatttcccctggaaggtccacacccagccctcactctaaacttccccttctcagct ccagatggtggaaggagaaatacctacactctccctggatgctgtgggacttggaatgca aatttctctcccggctctgcgaggtcagcaggagagatgatgcaagaggcctgcagctgg acctcctggctggtacaagtcacctgctccagcatccgatctcaagttattcagcagccc ctggggtttcttgatgaccaacagcagcccttaagtggtagtgtcagaggaaaggagatt tgggggtctgcagcccacccatgtgcatgtggctacccacctctctcctcctctctttct cacaaaagtctaaagatcccccagggccacgcccatgccccacagtcattcagttaccag tgcatctccaggtcaggggtggataaccggcatttcagcaagttcccaagtgatgcagat gctgctgctcagggaccagacattgaagccactgacttcagtggtcatggctggtgtcag cagccaagcgtagctctgccccaggtgcaacagaagctgcaggcagtgcaggtggtgcca cgtggctgccttctgggagtgcacctgatgtgcctttctgtgcctggaatccagatgggg atacttagagtccctgtccccatacaccaaccatcttcgacccaaggttgttgctgtagc cctgatcagaggacaggggatctgggggctccatgtgaatcaccttgtgctccaaggcca agttccactgatgaaatcacagggagctgcaatgatgaagcccagtactgtacagatgaa agtcttatttatgtgaaagatatatctgcagagggagcaataaaccgcaaagcaaagtat gaagtattaattcaacgggtctgtgatgaaccatacctcaactttgttgtgcaggaagct tctagatttatacatacgaacaataataaaaccatagttgaaatttgtcaaccatttact ttgtgccagaccctttacctgtatcagctcctttattcttcacagcaatccataaggcag tgcttttatgatgtcacctttacaggaccattgaggagcacctcaaagccagtgtga >gi568815587f:113875326_114089760|GENSCAN_predicted_peptide_6|423_aa MDLTKMGMIQLQNPSHPTGLLCKANQMRLAGTLCDVVIMVDSQEFHAHRTVLACTSKMFE ILFHRNSQHYTLDFLSPKTFQQILEYAYTATLQAKAEDLDDLLYAAEILEIEYLEEQCLK MLETIQASDDNDTEATMADGGAEEEEDRKARYLKNIFISKHSSEESGYASVAGQSLPGPM VDQSPSVSTSFGLSAMSPTKAAVDSLMTIGQSLLQGTLQPPAGPEEPTLAGGGRHPGVAE VKTEMMQVDEVPSQDSPGAAESSISGGMGDKVEERGKEGPGTPTRSSVITSARELHYGRE ESAEQVPPPAEAGQAPTGRPEHPAPPPEKHLGIYSVLPNHKADAVLSMPSSVTSGLHVQP ALAVSMDFSTYGGLLPQGFIQRELFSKLGELAVGMKSESRTIGEQCSVCGVELPDNEAVE QHR >gi568815587f:113875326_114089760|GENSCAN_predicted_CDS_6|1272_bp atggatctgacaaaaatgggcatgatccagctgcagaaccctagccaccccacggggcta ctgtgcaaggccaaccagatgcggctggccgggactttgtgcgatgtggtcatcatggtg gacagccaggagttccacgcccaccggacggtgctggcctgcaccagcaagatgtttgag atcctcttccaccgcaatagtcaacactatactttggacttcctctcgccaaagaccttc cagcagattctggagtatgcatatacagccacgctgcaagccaaggcggaggacctggat gacctgctgtatgcggccgagatcctggagatcgagtacctggaggaacagtgcctgaag atgctggagaccatccaggcctcagacgacaatgacacggaggccaccatggccgatggc ggggccgaggaagaagaggaccgcaaggctcggtacctcaagaacatcttcatctcgaag cattccagcgaggagagtgggtatgccagtgtggctggacagagcctccctgggcccatg gtggaccagagcccttcagtctccacttcatttggtctttcagccatgagtcccaccaag gctgcagtggacagtttgatgaccataggacagtctctcctgcagggaactcttcagcca cctgcagggcccgaggagccaactctggctgggggtgggcggcaccctggggtggctgag gtgaagacggagatgatgcaggtggatgaggtgcccagccaggacagccctggggcagcc gagtccagcatctcaggagggatgggggacaaggttgaggaaagaggcaaagaggggcct gggaccccgactcgaagcagcgtcatcaccagtgctagggagctacactatgggcgagag gagagtgccgagcaggtgccacccccagctgaggctggccaggcccccactggccgacct gagcacccagcacccccgcctgagaagcatctgggcatctactccgtgttgcccaaccac aaggctgacgctgtattgagcatgccgtcttccgtgacctctggcctccacgtgcagcct gccctggctgtctccatggacttcagcacctatggggggctgctgccccagggcttcatc cagagggagctgttcagcaagctgggggagctggctgtgggcatgaagtcagagagccgg accatcggagagcagtgcagcgtgtgtggggtcgagcttcctgataacgaggctgtggag cagcacaggtag >gi568815587f:113875326_114089760|GENSCAN_predicted_peptide_7|45_aa MPKIPPLEATTIFLDVNPKEGIKMVPKNNVKRLLNRVTLKTFFFP >gi568815587f:113875326_114089760|GENSCAN_predicted_CDS_7|138_bp atgcctaagataccaccattggaggctacaaccatcttccttgacgtcaatcctaaagaa gggataaagatggtacctaaaaacaatgtgaagcggctgctaaatcgagtgactctaaaa acatttttctttccttga