GENSCAN 1.0 Date run: 8-Nov-116 Time: 10:28:57 Sequence gi568815597f:81736985_82091142 : 354158 bp : 35.81% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1604 1615 12 1 0 86 94 10 0.216 1.72 1.02 Intr + 20071 20146 76 0 1 72 80 74 0.239 3.17 1.03 Intr + 36511 36554 44 2 2 76 91 17 0.002 -2.16 1.04 Term + 63010 63147 138 0 0 116 42 162 0.822 11.38 1.05 PlyA + 63249 63254 6 1.05 2.04 PlyA - 64896 64891 6 -0.45 2.03 Term - 65750 65530 221 0 2 87 38 202 0.568 11.32 2.02 Intr - 66075 65752 324 1 0 76 13 230 0.560 9.12 2.01 Init - 71386 71326 61 1 1 79 84 26 0.933 2.76 2.00 Prom - 72968 72929 40 -1.35 3.05 PlyA - 73198 73193 6 1.05 3.04 Term - 106984 106891 94 0 1 38 55 138 0.794 1.82 3.03 Intr - 121725 121605 121 1 1 77 94 101 0.877 8.23 3.02 Intr - 125804 125763 42 0 0 74 103 40 0.477 1.39 3.01 Init - 134258 134111 148 2 1 87 23 87 0.461 2.40 3.00 Prom - 134435 134396 40 -3.65 4.00 Prom + 152143 152182 40 -7.05 4.01 Init + 152547 152667 121 2 1 41 24 126 0.007 1.90 4.02 Intr + 170033 170246 214 0 1 99 94 234 0.942 21.95 4.03 Intr + 199744 199853 110 1 2 99 98 69 0.916 8.01 4.04 Intr + 205985 206785 801 0 0 54 121 582 0.793 48.60 4.05 Intr + 213205 213498 294 2 0 96 53 129 0.963 6.26 4.06 Intr + 214034 214137 104 0 2 80 81 24 0.997 -0.13 4.07 Intr + 214973 215158 186 1 0 54 75 210 0.999 15.26 4.08 Intr + 218893 219076 184 0 1 34 111 182 0.999 13.54 4.09 Intr + 229074 229199 126 1 0 87 91 161 0.999 16.03 4.10 Intr + 229420 229625 206 2 2 90 95 116 0.998 10.50 4.11 Intr + 231042 231215 174 2 0 82 92 60 0.928 5.01 4.12 Intr + 232194 232403 210 2 0 88 103 89 0.986 8.59 4.13 Intr + 233330 233550 221 1 2 106 94 121 0.699 10.68 4.14 Intr + 243811 243855 45 1 0 59 82 75 0.567 0.61 4.15 Intr + 244824 244992 169 0 1 86 106 17 0.935 2.33 4.16 Intr + 247599 247727 129 2 0 53 92 72 0.855 4.07 4.17 Intr + 248275 248371 97 0 1 45 86 24 0.955 -3.44 4.18 Intr + 249917 250045 129 0 0 101 69 45 0.923 3.65 4.19 Term + 253407 254161 755 1 2 147 37 695 0.999 62.91 4.20 PlyA + 254202 254207 6 1.05 5.00 Prom + 283152 283191 40 -3.75 5.01 Init + 286639 286842 204 0 0 39 83 117 0.049 5.20 5.02 Term + 309571 309762 192 0 0 77 38 198 0.731 10.14 5.03 PlyA + 311277 311282 6 1.05 6.00 Prom + 331274 331313 40 -6.05 6.01 Init + 333751 333878 128 0 2 72 109 165 0.994 16.68 6.02 Intr + 335165 335312 148 2 1 71 34 79 0.577 0.12 6.03 Term + 335488 335562 75 0 0 117 54 70 0.706 3.36 6.04 PlyA + 337114 337119 6 1.05 7.03 PlyA - 339135 339130 6 1.05 7.02 Term - 349086 349018 69 0 0 97 50 71 0.613 1.16 7.01 Intr - 350053 349903 151 0 1 91 32 121 0.804 6.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 50098 49919 180 1 0 70 45 157 0.924 8.83 S.002 Init + 57394 57471 78 0 0 50 64 49 0.824 -0.19 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:81736985_82091142|GENSCAN_predicted_peptide_1|89_aa MAVQLMFAITRDCVGVQPTKWNWLVKAFVEASRRKKDCPQLLLMLRSIVGSAWLFADIAN REGQGSVFRGQEQRQFQCKEMRETDLTAL >gi568815597f:81736985_82091142|GENSCAN_predicted_CDS_1|270_bp atggcagttcagctaatgtttgctatcaccagagactgtgttggagtacagccaaccaaa tggaattggctagtaaaagcttttgtggaggcaagtcgaaggaaaaaggattgcccacag ttacttcttatgcttaggtccatcgtaggctctgcctggctgtttgctgacattgccaac cgagaggggcaaggaagtgtgttcagaggacaagagcaaaggcagtttcaatgtaaggaa atgagggaaacagacctaacagctctgtga >gi568815597f:81736985_82091142|GENSCAN_predicted_peptide_2|201_aa MVTFDIIKQCRHCWRVPTNQGPTRPAAALSLRLREALCPPRGPPGPRPLHPPDPRGRPGS SSPAAPNRSQSRRRPDHIWLRFTVILASPHHLTTDEKEQQPSKTRPSPPPPSTSPELAPA AQPPRSTYMRDRRAKRAGRWGGPGPQRPEQISSAEDALHTPPPQAPSERTGGTLRPQAFM KLKNAWLREPHPNPLASAPGN >gi568815597f:81736985_82091142|GENSCAN_predicted_CDS_2|606_bp atggtaacttttgatattattaagcaatgtagacactgctggcgagttcctacaaaccaa ggcccaacgcggcccgctgctgccctctcactgcggctccgggaggctctctgccctccc cgtggcccgccgggtccccggcccctgcaccctcccgacccacgaggccgccctggcagc agcagccccgccgctcccaatcgtagccagtcccggcggcggcccgaccacatctggctg cgctttactgtcatcctcgcctccccacaccacctaaccactgacgagaaggaacagcaa ccctcaaaaactcgtccaagccctcccccaccgtccacttcccccgaactcgctccagcc gctcagcctccccgctccacttacatgcgggaccggagggcaaagcgggcgggacgctgg ggtggccccgggccccagcgcccggagcaaataagctccgccgaggacgcgcttcacact cctcctccccaggcccccagtgaaaggacgggcggaaccttgcgaccccaagcttttatg aaattaaaaaatgcatggctacgggagcctcatcctaacccccttgccagcgcccccggc aactag >gi568815597f:81736985_82091142|GENSCAN_predicted_peptide_3|134_aa MDEAGNHHSQQTVTRTENQTPHVLTHRWELNSENTWTQDGEHHTTGPVGDVFQCMGDTGR GESECAKFSQSLTEVLNPNSDTLRNNTKSNTDPVQSVEAEHLKRPTGESTKTVDKQSFST NEKEAKEDGIKKKA >gi568815597f:81736985_82091142|GENSCAN_predicted_CDS_3|405_bp atggacgaagctggaaaccatcattctcagcaaactgtcacaaggacagaaaaccaaaca ccgcatgttctcactcatagatgggaactgaacagtgagaacacttggacacaggatggg gaacatcacacaacagggcctgttggggatgtgtttcagtgcatgggagacactggtaga ggcgaaagtgagtgtgctaaattttcacagtccctgacggaagtactcaaccccaattct gatactttaagaaataacaccaagtcaaatactgaccctgtgcagtcggtggaagctgaa catctaaagagacctactggtgaatcaacgaagactgtggataaacaaagtttttccaca aatgaaaaagaggcaaaagaggatggaatcaagaagaaagcatga >gi568815597f:81736985_82091142|GENSCAN_predicted_peptide_4|1424_aa MGLKERSHCSHNIEVGGKAASVDIEAAASYAEDLAKIIDGGFSRAALPFGLVRRELSCEG YSIDLRCPGSDVIMIESANYGRTDDKICDADPFQMENTDCYLPDAFKIMTQRCNNRTQCI VVTGSDVFPDPCPGTYKYLEVQYECVPYIFVCPGTLKAIVDSPCIYEAEQKAGAWCKDPL QAADKIYFMPWTPYRTDTLIEYASLEDFQNSRQTTTYKLPNRVDGTGFVVYDGAVFFNKE RTRNIVKFDLRTRIKSGEAIINYANYHDTSPYRWGGKTDIDLAVDENGLWVIYATEQNNG MIVISQLNPYTLRFEATWETVYDKRAASNAFMICGVLYVVRSVYQDNESETGKNSIDYIY NTRLNRGEYVDVPFPNQYQYIAAVDYNPRDNQLYVWNNNFILRYSLEFGPPDPAQVPTTA VTITSSAELFKTIISTTSTTSQKGPMSTTVAGSQEGSKGTKPPPAVSTTKIPPITNIFPL PERFCEALDSKGIKWPQTQRGMMVERPCPKGTRGTASYLCMISTGTWNPKGPDLSNCTSH WVNQLAQKIRSGENAASLANELAKHTKGPVFAGDVSSSVRLMEQLVDILDAQLQELKPSE KDSAGRSYNKAIVDTVDNLLRPEALESWKHMNSSEQAHTATMLLDTLEEGAFVLADNLLE PTRVSMPTENIVLEVAVLSTEGQIQDFKFPLGIKGAGSSIQLSANTVKQNSRNGLAKLVF IIYRSLGQFLSTENATIKLGADFIGRNSTIAVNSHVISVSINKESSRVYLTDPVLFTLPH IDPDNYFNANCSFWNYSERTMMGYWSTQGCKLVDTNKTRTTCACSHLTNFAILMAHREIA YKDGVHELLLTVITWVGIVISLVCLAICIFTFCFFRGLQSDRNTIHKNLCINLFIAEFIF LIGIDKTKYAIACPIFAGLLHFFFLAAFAWMCLEGVQLYLMLVEVFESEYSRKKYYYVAG YLFPATVVGVSAAIDYKSYGTEKANYRVCDGYYNTDLPGSWVLGAFALLCLLGLTWSFGL LFINEETIVMAYLFTIFNAFQGVFIFIFHCALQKKVRKEYGKCFRHSYCCGGLPTESPHS SVKASTTRTSARYSSGTQSRIRRMWNDTVRKQSESSFISGDINSTSTLNQGMTGNYLLTN PLLRPHGTNNPYNTLLAETVVCNAPSAPVFNSPGHSLNNARDTSAMDTLPLNGNFNNSYS LHKGDYNDSVQVVDCGLSLNDTAFEKMIISELVHNNLRGSSKTHNLELTLPVKPVIGGSS SEDDAIVADASSLMHSDNPGLELHHKELEAPLIPQRTHSLLYQPQKKVKSEGTDSYVSQL TAEAEDHLQSPNRDSLYTSMPNLRDSPYPESSPDMEEDLSPSRRSENEDIYYKSMPNLGA GHQLQMCYQISRGNSDGYIIPINKEGCIPEGDVREGQMQLVTSL >gi568815597f:81736985_82091142|GENSCAN_predicted_CDS_4|4275_bp atggggctgaaggaaagaagccattgttcccataacatagaagtgggaggtaaagcagca agtgttgatatagaagctgcagcaagttatgcagaagatctagctaagatcattgatgga ggtttcagcagagcagctttaccatttgggctggtgaggcgagaattatcctgtgaaggt tattctatagatctgcgatgcccgggcagtgatgtcatcatgattgagagcgctaactat ggtcggacggatgacaagatttgtgatgctgacccatttcagatggagaatacagactgc tacctccccgatgccttcaaaattatgactcaaaggtgcaacaatcgaacacagtgtata gtagttactgggtcagatgtgtttcctgatccatgtcctggaacatacaaataccttgaa gtccaatatgaatgtgtcccttacatttttgtgtgtcctgggaccttgaaagcaattgtg gactcaccatgtatatatgaagctgaacaaaaggcgggtgcttggtgcaaggaccctctt caggctgcagataaaatttatttcatgccctggactccctatcgtaccgatactttaata gaatatgcttctttagaagatttccaaaatagtcgccaaacaacaacatataaacttcca aatcgagtagatggtactggatttgtggtgtatgatggtgctgtcttctttaacaaagaa agaacgaggaatattgtgaaatttgacttgaggactagaattaagagtggcgaggccata attaactatgccaactaccatgatacctcaccatacagatggggaggaaagactgatatc gacctagcagttgatgaaaatggtttatgggtcatttacgccactgaacagaacaatgga atgatagttattagccagctgaatccatacactcttcgatttgaagcaacgtgggagact gtatacgacaaacgtgccgcatcaaatgcttttatgatatgcggagtcctctatgtggtt aggtcagtttatcaagacaatgaaagtgaaacaggcaagaactcaattgattacatttat aatacccgattaaaccgaggagaatatgtagatgttcccttccccaaccagtatcagtat attgctgcagtggattacaatccaagagataaccaactttacgtgtggaacaataacttc attttacgatattctctggagtttggtccacctgatcctgcccaagtgcctaccacagct gtgacaataacttcttcagctgagctgttcaaaaccataatatcaaccacaagcactact tcacagaaaggccccatgagcacaactgtagctggatcacaggaaggaagcaaagggaca aaaccacctccagcagtttctacaaccaaaattccacctataacaaatatttttcccctg ccagagagattctgtgaagcattagactccaaggggataaagtggcctcagacacaaagg ggaatgatggttgaacgaccatgccctaagggaacaagaggaactgcctcatatctctgc atgatttccactggaacatggaaccctaagggccccgatcttagcaactgtacctcacac tgggtgaatcagctggctcagaagatcagaagcggagaaaatgctgctagtcttgccaat gaactggctaaacataccaaagggccagtgtttgctggggatgtaagttcttcagtgaga ttgatggagcagttggtggacatccttgatgcacagctgcaggaactgaaacctagtgaa aaagattcagctggacggagttataacaaggcaattgttgacacagtggacaaccttctg agacccgaagctttggaatcatggaaacatatgaattcttctgaacaagcacatactgca acaatgttactcgatacattggaagaaggagcttttgtcctagctgacaatcttttagaa ccaacaagggtctcaatgcccacagaaaatattgtcctggaagttgccgtactcagtaca gaaggacagatccaagactttaaatttcctctgggcatcaaaggagcaggcagctcaatc caactgtccgcaaataccgtcaaacagaacagcaggaatgggcttgcaaagttggtgttc atcatttaccggagcctgggacagttccttagtacagaaaatgcaaccattaaactgggt gctgattttattggtcgtaatagcaccattgcagtgaactctcacgtcatttcagtttca atcaataaagagtccagccgagtatacctgactgatcctgtgctttttaccctgccacac attgatcctgacaattatttcaatgcaaactgctccttctggaactactcagagagaact atgatgggatattggtctacccagggctgcaagctggttgacactaataaaactcgaaca acgtgtgcatgcagccacctaaccaattttgcaattctcatggcccacagggaaattgca tataaagatggcgttcatgaattacttcttacagtcatcacctgggtgggaattgtcatt tcccttgtttgcctggctatctgcatcttcaccttctgctttttccgtggcctacagagt gaccgaaatactattcacaagaacctttgtatcaaccttttcattgctgaatttattttc ctaataggcattgataagacaaaatatgcgattgcatgcccaatatttgcaggacttcta cactttttctttttggcagcttttgcttggatgtgcctagaaggtgtgcagctctaccta atgttagttgaagtttttgaaagtgaatattcaaggaaaaaatattactatgttgctggt tacttgtttcctgccacagtggttggagtttcagctgctattgactataagagctatgga acagaaaaagctaattaccgtgtttgtgatggctactataatacggacttacctgggtct tgggtgcttggcgctttcgctcttctgtgtcttcttggcctcacctggtcctttgggttg ctttttattaatgaggagactattgtgatggcatatctcttcactatatttaatgctttc cagggagtgttcattttcatctttcactgtgctctccaaaagaaagtacgaaaagaatat ggcaagtgcttcagacactcatactgctgtggaggcctcccaactgagagtccccacagt tcagtgaaggcatcaaccaccagaaccagtgctcgctattcctctggcacacagagtcgt ataagaagaatgtggaatgatactgtgagaaaacaatcagaatcttcttttatctcaggt gacatcaatagcacttcaacacttaatcaaggaatgactggcaattacctactaacaaac cctcttcttcgaccccacggcactaacaacccctataacacattgctcgctgaaacagtt gtatgtaatgccccttcagctcctgtatttaactcaccaggacattcactgaacaatgcc agggatacaagtgccatggatactctaccgctaaatggtaattttaacaacagctactcg ctgcacaagggtgactataatgacagcgtgcaagttgtggactgtggactaagtctgaat gatactgcttttgagaaaatgatcatttcagaattagtgcacaacaacttacggggcagc agcaagactcacaacctcgagctcacgctaccagtcaaacctgtgattggaggtagcagc agtgaagatgatgctattgtggcagatgcttcatctttaatgcacagcgacaacccaggg ctggagctccatcacaaagaactcgaggcaccacttattcctcagcggactcactccctt ctgtaccaaccccagaagaaagtgaagtccgagggaactgacagctatgtctcccaactg acagcagaggctgaagatcacctacagtcccccaacagagactctctttatacaagcatg cccaatcttagagactctccctatccggagagcagccctgacatggaagaagacctctct ccctccaggaggagtgagaatgaggacatttactataaaagcatgccaaatcttggagct ggccatcagcttcagatgtgctaccagatcagcaggggcaatagtgatggttatataatc cccattaacaaagaagggtgtattccagaaggagatgttagagaaggacaaatgcagctg gttacaagtctttaa >gi568815597f:81736985_82091142|GENSCAN_predicted_peptide_5|131_aa MRSSWIIPVGSKSNDKCRYKRHRRDAQRRAGHMKTEAETGVMQPQTKECFQPSEAGRGKE GYPLRAFEVVTSFFYSTSILSTDKVLGSEIDVGKSKMNKADPQLPQDTLTDIDVENETRE TSNSFKAFLSG >gi568815597f:81736985_82091142|GENSCAN_predicted_CDS_5|396_bp atgagatcatcctggattatcccagtgggctctaaatccaatgacaagtgtcgttataag agacacaggagagatgcacagagaagagcaggccacatgaagacggaggccgagactgga gtgatgcagccacagaccaaggaatgcttccagccatcagaagctggaaggggcaaggaa ggatatccccttagagccttcgaggttgtgacttcattcttttattctacaagcatcctg agcactgacaaagttttgggctctgagatagatgttggaaagtcaaagatgaataaggct gatcctcaacttccacaagatacactcacagacatagatgtagaaaatgagactcgagag acgagcaactcgttcaaggctttcttaagtggctaa >gi568815597f:81736985_82091142|GENSCAN_predicted_peptide_6|116_aa MVYGASEAIGQHQSLAAKPRRSGKESVREPWARVPGALGMAARKAGLAAKGEGEGVEGYL PLPQESREGVETRREGVEVLAPSPEKRDLPLRKMKEIEIKRRERLKSGKDKVVEGQ >gi568815597f:81736985_82091142|GENSCAN_predicted_CDS_6|351_bp atggtctacggggcttccgaggcgatcgggcagcatcagtctttagccgctaagccgaga agatctgggaaggagtcagtcagagagccttgggccagagttccaggggctctgggaatg gctgccaggaaagcgggacttgccgctaagggtgaaggagaaggggttgaggggtacttg cccctgccccaggaaagcagagaaggggtagagacaaggagagaaggggttgaggtactt gccccttccccagaaaagcgggacttgccgctaaggaaaatgaaagaaattgaaattaag agaagggagagattgaagagtggaaaggataaagtggttgagggacagtga >gi568815597f:81736985_82091142|GENSCAN_predicted_peptide_7|73_aa XKIIFHNDSLTVLPLIIISEAFRDRICGVLSSDANRMTKQQPHELIESQGEVKTGAKPLA PLALQLADLSCRL >gi568815597f:81736985_82091142|GENSCAN_predicted_CDS_7|222_bp nnaaaaatcattttccacaatgactcactgacagttctccctctaatcatcatttctgaa gcatttagggacaggatctgtggtgtcctctcctcagatgccaacagaatgacaaagcag cagccacatgagctgatagagtcacaaggagaggttaagactggagctaaaccattggct cccctggctctccagcttgctgacttatcctgcagactttga