GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:35:20 Sequence gi568815596f:100902702_101106071 : 203370 bp : 45.20% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2032 2085 54 1 0 125 94 71 0.929 9.49 1.02 Intr + 6685 6772 88 1 1 98 52 16 0.489 -1.03 1.03 Intr + 11235 11292 58 2 1 105 37 51 0.318 0.16 1.04 Intr + 11546 11675 130 0 1 98 40 60 0.213 1.95 1.05 Intr + 22445 22593 149 2 2 101 58 234 0.812 21.58 1.06 Intr + 29887 29988 102 2 0 88 81 79 0.975 7.35 1.07 Intr + 30209 30300 92 0 2 86 52 65 0.947 2.41 1.08 Intr + 35052 35141 90 2 0 59 117 68 0.897 6.99 1.09 Term + 36779 36850 72 1 0 68 39 86 0.690 -0.39 1.10 PlyA + 38540 38545 6 1.05 2.06 PlyA - 39141 39136 6 1.05 2.05 Term - 40458 40167 292 2 1 80 53 201 0.899 10.62 2.04 Intr - 40817 40778 40 0 1 68 39 83 0.510 -1.32 2.03 Intr - 46304 46173 132 0 0 61 77 81 0.273 4.92 2.02 Intr - 48968 48912 57 0 0 92 103 3 0.207 1.06 2.01 Init - 50504 50456 49 2 1 86 58 32 0.179 -0.89 2.00 Prom - 52453 52414 40 -2.06 3.00 Prom + 58887 58926 40 -0.56 3.01 Init + 61757 61773 17 1 2 66 72 6 0.245 -4.44 3.02 Intr + 62959 63065 107 1 2 105 84 87 0.879 9.86 3.03 Intr + 63509 63667 159 0 0 126 20 31 0.286 0.06 3.04 Intr + 65580 65727 148 1 1 50 86 250 0.601 20.29 3.05 Intr + 68289 68373 85 0 1 65 107 30 0.422 2.42 3.06 Intr + 72102 72243 142 2 1 83 76 112 0.909 9.43 3.07 Intr + 72757 72866 110 2 2 22 131 40 0.087 1.70 3.08 Intr + 75009 75098 90 2 0 104 109 12 0.799 5.09 3.09 Intr + 79530 79676 147 2 0 5 58 248 0.670 13.93 3.10 Intr + 80037 80267 231 2 0 74 26 101 0.414 0.47 3.11 Intr + 85378 85575 198 0 0 80 99 131 0.577 12.95 3.12 Intr + 87693 87745 53 2 2 93 96 95 0.274 8.51 3.13 Intr + 88079 88171 93 2 0 59 70 67 0.645 1.08 3.14 Intr + 90646 90826 181 1 1 79 94 162 0.997 15.77 3.15 Intr + 92699 92833 135 1 0 122 44 41 0.299 3.86 3.16 Intr + 99058 99186 129 0 0 87 -20 122 0.236 2.19 3.17 Intr + 100001 100107 107 1 2 71 65 205 0.964 15.51 3.18 Intr + 101457 101582 126 0 0 56 43 209 0.998 12.99 3.19 Intr + 103258 103370 113 1 2 54 106 124 0.996 10.82 3.20 Term + 103649 103680 32 0 2 70 37 71 0.977 -1.78 3.21 PlyA + 103699 103704 6 1.05 4.19 PlyA - 103923 103918 6 -1.75 4.18 Term - 105572 105120 453 2 0 64 43 307 0.990 18.96 4.17 Intr - 108325 108228 98 2 2 99 80 129 0.990 12.93 4.16 Intr - 119820 119580 241 0 1 51 76 341 0.157 26.32 4.15 Intr - 124750 124682 69 1 0 55 107 90 0.821 6.98 4.14 Intr - 125445 125314 132 0 0 113 24 174 0.623 14.34 4.13 Intr - 125731 125602 130 0 1 13 82 162 0.981 8.80 4.12 Intr - 127075 126790 286 2 1 98 80 345 0.871 31.10 4.11 Intr - 129684 129567 118 0 1 42 22 260 0.686 14.84 4.10 Intr - 131057 130843 215 0 2 47 67 253 0.856 17.53 4.09 Intr - 133467 133317 151 0 1 35 115 173 0.994 14.44 4.08 Intr - 135007 134831 177 1 0 123 64 163 0.995 17.52 4.07 Intr - 135954 135649 306 0 0 90 23 446 0.500 34.95 4.06 Intr - 137684 137477 208 1 1 119 59 201 0.974 19.38 4.05 Intr - 147940 147700 241 2 1 86 52 426 0.331 35.41 4.04 Intr - 151635 151407 229 0 1 56 111 297 0.864 26.24 4.03 Intr - 156838 156726 113 2 2 80 98 62 0.519 6.50 4.02 Intr - 187663 187508 156 2 0 72 105 72 0.712 7.28 4.01 Intr - 187757 187692 66 0 0 59 55 84 0.131 1.08 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 159142 158974 169 1 1 70 59 59 0.855 0.35 S.002 Term + 171504 171602 99 2 0 84 42 86 0.862 1.73 S.003 Sngl + 196564 196914 351 0 0 81 38 162 0.962 6.55 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:100902702_101106071|GENSCAN_predicted_peptide_1|278_aa XKNCIENLMDEDEKDRAKRESNVAKKRDCSRFICTISLDINSVGLKAKASQKDLIPIPAT AMHLGQLAFLKYHRHSPFVRVWGYCARTRLDPELQVQPSSGFQSSRELRLRASRNKSEKK RRDQFNVLIKELSSMLPGNTRKMDKTTVLEKVIGFLQKHNGGNYLIPTVDDGVRHVDIQG HGTTCKGRHHSHREEVSAQTEICDIQQDWKPSFLSNEEFTQLMLEALDGFIIAVTTDGSI IYVSDSITPLLGHLPEASDGADTMNTEVIPIITPEKLG >gi568815596f:100902702_101106071|GENSCAN_predicted_CDS_1|837_bp nngaaaaactgcatagaaaatctaatggatgaagatgagaaagacagagccaagagggag agtaatgtggcaaagaagagagactgttctcgtttcatttgtaccatttctttggatata aattctgtaggtctgaaagcaaaggcctctcagaaggacctgatccccatcccagctaca gccatgcatcttgggcagctggccttcctgaaataccacagacacagcccctttgtaaga gtatggggctactgtgctaggactcgcttggatcctgagctgcaagtacaaccctcctct ggattccagagttctagagagctacgcctgagagcttctcgaaacaagtctgagaagaag cgtcgggaccagttcaatgttctcatcaaagagctcagttccatgctccctggcaacacg cggaaaatggacaaaaccaccgtgttggaaaaggtcatcggatttttgcagaaacacaat ggtggtaactacctaattccaactgttgacgatggagttcgtcatgtagacattcagggc catggtaccacctgcaagggcaggcaccacagccatagagaagaagtctcagcgcaaacg gaaatctgtgacattcagcaagactggaagccttcattcctcagtaatgaagaattcacc cagctgatgttggaggcattagatggcttcattatcgcagtgacaacagacggcagcatc atctatgtctctgacagtatcacgcctctccttgggcatttaccggaagctagtgatggg gctgacaccatgaatacagaagtgatccccatcatcaccccagagaagctgggctga >gi568815596f:100902702_101106071|GENSCAN_predicted_peptide_2|189_aa MGFRHVGQAGFKLLTSAPGTHHSTFCFYEFGYGTSLKSDVRQDRLFMENGHRLALMQAAL LPHAIVYASLSDPWFTRHQVSAVESSDIRITLGADWCHHYGIILVLLSKGGGRTSLNPAI LHLVSPPGMLANVHQETLQDVHGSTFQESLTGMNAGIAAHSYDGTHPAIRRAHSSYICQC EADAKHKRC >gi568815596f:100902702_101106071|GENSCAN_predicted_CDS_2|570_bp atggggtttcgccatgttggccaggctggtttcaaactcctgacctcagcccctggcact caccattctactttctgtttctatgaatttggctatggtacctcattaaaatcagatgta cgacaggacagattattcatggaaaatggacacaggcttgcactgatgcaagcagctttg ctacctcatgcaattgtgtatgcttcattaagtgacccgtggttcacgaggcatcaagtg tcagctgtggagtcttcagacatccgcattaccctaggggcagactggtgccaccactat ggcatcattctggtgttacttagtaaaggtggaggcagaacttccctcaacccagccatt ctacatctagtcagtcctccagggatgcttgcaaacgttcatcaggaaacactacaggat gttcatggaagcaccttccaagaatccctaactgggatgaatgcaggaattgcagcacat tcctacgatggaacacacccagcaataagaagagcccacagcagctacatctgccagtgt gaggcagatgcgaagcacaaacgatgttga >gi568815596f:100902702_101106071|GENSCAN_predicted_peptide_3|800_aa MQARPGAPPIIGYLPFEVLGTSGYDYYHIDDLELLARCHQHRQVFWESQKWVTLWDVVGA LKGMYTAYWGTQEDPHSWGSGKALCHIRGEHISKLMQFGKGKSCCYRFLTKGQQWIWLQT HYYITYHQWNSKPEFIVCTHSVVSYADVRVERRQELALEDPPSEALHSSALKDKGSSLEP RQHFNTLDVGASGLNTSHSPSASSRSSHKSSHTAMSEPTSTPTKLMAEASTPALPRSATL PQELPVPGLSQAATMPAPLPSPSSCDLTQQLLPQTVLQSTPAPMAQFSAQFSMFQTIKDQ LEQRTRILQANIRWQQEELHKIQEQLCLVQDSNVQLPAQAAIRRRWKQLCCFSSNAGIQG DVGNDDIHITAYWRLQRELGGSPFVVCQQGSSAKPSSSSSHCDPQILELFGEMFLQQPAV SLSFSSTQRPEAQQQLQQRSAAVTQPQLGAGPQLPGQISSAQVTSQHLLRESSVISTQDA SQCQPSPDFSHDRQLRLLLSQPIQPMMPGSCDARQPSEVSRTGRQVKYAQSQTVFQNPDA HPANSSSAPMPVLLMGQAVLHPSFPASQPSPLQPAQARQQPPQHYLQVQAPTSLHSEQQD SLLLSTYSQQPGTLGYPQPPPAQPQPLRPPRRVISNTLGHPRTGTTRQPCGSTAQDSHDW QSCAGPARRKHLQRRMAPAKKGGEKKKGRSAINEVVTREYTINIHKRIHGVGFKKRAPRA LKEIRKFAMKEMGTPDVRIDTRLNKAVWAKGIRNVPYRIRVRLSRKRNEDEDSPNKLYTL VTYVPVTTFKNLQTVNVDEN >gi568815596f:100902702_101106071|GENSCAN_predicted_CDS_3|2403_bp atgcaggccaggcctggagcacctccaatcataggatacctgccttttgaagtgctggga acctcaggctatgactactaccacattgatgacctggagctcctggccaggtgtcaccag caccggcaggtcttctgggaaagccagaaatgggtgactttgtgggatgtggtcggtgca ttgaaggggatgtacacagcatactgggggactcaagaggaccctcacagctggggttca gggaaggctctttgccacatacgaggcgaacacattagcaaattgatgcagtttggcaaa gggaagtcgtgttgctaccggtttctgaccaaaggtcagcagtggatctggctgcagact cactactacatcacctaccatcagtggaactccaagcccgagttcatcgtgtgcacacac tcggtggtcagttacgcagatgtccgggtggaaaggaggcaggagctggctctggaagac ccgccatccgaggccctccactcctcagcactaaaggacaagggctcaagcctggaacct cggcagcactttaacacactcgacgtgggtgcctcgggccttaataccagtcattcgcca tcggcgtcctcaagaagttcccacaaatcctcgcacacagccatgtcagaacccacctcc actcccaccaagctgatggcagaggccagcaccccggctttgccaagatcagccaccctg ccccaagagttacctgtccccgggctcagccaggcagccaccatgccggcccctctgcct tccccatcgtcctgcgacctcacacagcagctcctgcctcagaccgttctgcagagcacg cccgctcccatggcacagttttcggcacagttcagcatgttccagaccatcaaagaccag ctagagcagcggacgcggatcctgcaggccaatatccggtggcaacaggaagagctccac aagatccaggagcagctctgcctggtccaggactccaacgtccagctcccagcccaggct gccatcaggaggagatggaagcagctctgttgtttctcatccaacgcaggcatccaggga gatgtgggaaatgatgatatccatatcacagcctactggagactgcagagggagctgggt ggctccccatttgtggtgtgccagcagggaagctctgcaaagcctagcagcagcagctct cactgtgacccccagatattagaattgtttggagagatgttcctgcagcagccagctgta tccctgagcttcagcagcacccagcgacctgaggctcagcagcagctacagcaaaggtca gctgcagtgactcagccccagctcggggcgggcccccaacttccagggcagatctcctct gcccaggtcacaagccagcacctgctcagagaatcaagtgtgatatcaacccaggatgcc agccagtgccagcccagcccagacttcagccatgatcggcagctcaggctgttgctgagc cagcccatccagcccatgatgcccgggtcctgtgacgcaaggcagccctcggaagtcagc aggacgggacggcaagtcaagtacgcccagagccagaccgtgtttcaaaatccagacgca caccccgccaacagcagcagcgccccgatgcccgtcctgctgatggggcaggcggtgctc caccccagcttccctgcctcccaaccatcgcccctgcagcctgcacaggcccggcagcag ccaccgcagcactacctgcaggtacaggcaccaacctctttgcacagtgagcagcaggac tcgctacttctctccacctactcacaacagccagggaccctgggctacccccaaccaccc ccagcacagccccagcccctacgtcctccccgaagggtcatatccaacacactgggccac ccacgcacagggacgacgcgacagccctgtggctccaccgcacaggacagccacgactgg caatcctgtgccggccctgctcgacgcaagcacctccagcgccggatggctcccgcaaag aagggtggcgagaagaaaaagggccgttctgccatcaacgaagtggtaacccgagaatac accatcaacattcacaagcgcatccatggagtgggcttcaagaagcgtgcacctcgggca ctcaaagagattcggaaatttgccatgaaggagatgggaactccagatgtgcgcattgac accaggctcaacaaagctgtctgggccaaaggaataaggaatgtgccataccgaatccgt gtgcggctgtccagaaaacgtaatgaggatgaagattcaccaaataagctatatactttg gttacctatgtacctgttaccactttcaaaaatctacagacagtcaatgtggatgagaac taa >gi568815596f:100902702_101106071|GENSCAN_predicted_peptide_4|1129_aa XWSPDSDHMPHTHEELASPMLTGRLVGALDAVLDSNARVAPFRILLQVPGSQVYSPIACG ELLNGSDVYWAIATGATLEEINQHWDWLEQNLLHTLSVFDNKDDIASFVKGKALIAEETS SRLAEQEEEPEKFREALVKFEARFNFPEAEKLVTYYSCCCWKGRVPRQGWLYLSINHLCF YSFFLGKELKLVVPWVDIQKLERTSNVFLTDTIRITTQNKERDFSMFLNLDEVFKVMEQL ADVTLRRLLDNEVFDLDPDLQEPSQITKRDLEARAQNEFFRAFFRLPRKEKLHAVVDCSL WTPFSRCHTTGRMFASDSYICFASREDGCCKIILPLREVVSIEKMEDTSLLPHPIIVSIR SKVAFQFIELRDRDSLVEALLARLKQVHANHPVHYDTSADDDMVWSLQTLMIPFFMSQAV VSKGEYTWGEPCGDEKNAGPASLVFHSTSMCSDHRFGDLEMMSSQNSEESEKEKSPLMHP DALVTAFQQSGSQSPDSRMSREQIKISLWNDHFVEYGRTVCMFRTEKIRKLVAMGIPESL RGRLWLLFSDAVTDLASHPGYYGNLVEESLGKCCLVTEEIERDLHRSLPEHPAFQNETGI AALRRVLTAYAHRNPKIGYCQSMNILTSVLLLYTKEEEAFWLLVAVCERMLPDYFNHRVI GAQVDQSVFEELIKGHLPELAEHMNDLSALASVSLSWFLTLFLSIMPLESAVNVVDCFFY DGIKAIFQLGLAVLEANAEDLCSSKDDGQALMILSRFLDHIKNEDSPGPPVGSHHAFFSD DQEPYPVTDISDLIRDSYEKFGDQSVEQIEHLRYKHRIRVLQGHEDTTKQNVVGSRRLTP VITLRVVIPEVSILPEDLEELYDLFKREHMMSCYWEQPRPMASRHDPSRPYAEQYRIDAR QFAHLFQLVSPWTCGAHTEILAERTFRLLDDNMDQLIEFKAFVSCLGDAVDYQKQLKQMI KDLAKEKDKTEKELPKMSQREFIQFCKTLYSMFHEDPEENDLYQAIATVTTLLLQIGEVG QRGSSSGSCSQECGEELRASAPSPEDSVFADTGKTPQDSQAFPEAAERDWTVSLEHILAS LLTEQSLVNFFEKPLDMKSKLENAKINQYNLKTFEMSHQSQSELKLSNL >gi568815596f:100902702_101106071|GENSCAN_predicted_CDS_4|3390_bp ncatggagtccagacagcgaccacatgcctcacacgcatgaggagctggccagccccatg ctcacaggtcgcctggtcggcgctctggatgcagtgttggattccaatgcacgggtcgct ccatttcgaattctgcttcaagttcccggctcccaggtttattctcccatagcatgtggt gagttactgaatggatcagacgtttactgggccatagccactggtgcaacattagaggaa atcaatcagcactgggactggctggaacaaaatctcctccacaccttgtctgtctttgat aataaagatgacattgccagttttgtcaaagggaaggctctgatagccgaggagaccagc agcaggctcgccgagcaggaggaggaacccgagaaattccgagaagccctggtgaagttc gaggccaggttcaacttccccgaggcggagaagctggtcacctactactcctgctgctgt tggaagggcagggtgccccgccagggctggctgtacctcagcatcaaccacctctgcttc tactccttcttcctgggcaaggaacttaaacttgtggttccgtgggttgatatccagaaa ttagaaagaacgtccaatgtctttctgacggataccatccgaatcaccacgcagaataag gagcgtgacttctccatgttcctgaacctggatgaggtgtttaaggtcatggagcagctg gccgacgtgacgctgcgaaggctgctggataatgaggtctttgacctcgaccccgatctg caggagccgagccagatcaccaagagggacctggaagccagagcacagaatgagttcttc cgggctttcttcaggttgccgaggaaggagaagctgcacgcggttgtggactgttcgctc tggacgccgttcagtcgctgtcacaccacggggcggatgttcgcctctgacagctacatc tgctttgccagcagagaagatggctgctgtaagatcatcctgccactcagagaggtggtg agcatcgagaagatggaggacacgagcctgctgccgcatcccatcattgtcagtatcaga agcaaggtggccttccagttcattgagctccgggaccgagacagcctggtggaggcgctg cttgcgaggttgaagcaggtccacgccaaccaccccgtgcactacgacacctctgcggat gatgacatggtatggtccctccagaccctgatgatccccttcttcatgtctcaggccgtg gtctccaaaggggagtacacatggggagagccatgtggggatgagaagaatgcggggcct gcttcactcgtgtttcattcaacaagcatgtgcagtgaccacagatttggggatcttgaa atgatgtcttctcaaaatagcgaggagagtgagaaagagaagagcccgctgatgcacccc gatgccctggtcaccgccttccagcagtcaggcagccagagccctgactcccgaatgtcc agagaacagataaaaataagcctgtggaatgaccactttgtggaatacggcagaaccgtg tgtatgtttcgcacagagaagattcggaagctcgtagccatgggcatccctgaatctttg cgagggagactctggcttctcttctcagatgcggtgacggatcttgcctcacaccctggt tactacgggaatctggtggaggagtccctggggaaatgctgcctggtaaccgaggagata gaacgagacctgcaccgctccctgccagagcaccccgccttccagaacgaaacgggaatt gctgctttgaggagagtcttgacggcctatgcccaccggaaccccaagattggatactgc cagtccatgaacatcctgacctccgtgctgctgctgtacaccaaggaggaggaagccttc tggctgttggttgctgtgtgtgagcggatgctgcccgattacttcaaccaccgagtgatc ggggcacaagttgaccagtctgtcttcgaggagctcatcaagggtcatctcccagagctg gcagagcacatgaacgacctctcagccctggcgtccgtctctctctcgtggttcctgacc ctgttcctcagcatcatgcctctagagagtgcggtgaatgtggtagactgcttcttctat gatggcatcaaagccatcttccagctgggactggctgtgcttgaggccaatgctgaggac ctgtgcagcagcaaggatgatggccaggccttgatgatcctcagcaggtttctagatcac attaagaatgaggacagcccagggcccccagttggcagccaccatgcctttttctccgac gaccaggagccctaccctgtgactgatatttcggacctgatccgggattcctatgagaaa tttggagaccagtctgtggagcagatcgagcacctacgttacaagcacaggatcagggtc ctccaaggccacgaggacaccacaaagcagaacgtggtgggtagcagacgcctcaccccg gtgatcacgcttcgagtcgttatcccggaagtctcaattcttcctgaagacctagaggag ctctacgacttattcaagagagaacatatgatgagctgttactgggagcagcccaggccc atggcctcacgccacgaccccagccggccctatgctgagcagtaccgcatagatgcccgg cagtttgcacacctgtttcagctagtctcgccctggacctgcggggcccacacggagatc ctcgccgaaaggacgttcaggctcttggatgacaacatggaccagctcatcgagttcaaa gcgtttgtgagctgcctcggtgatgcagttgattatcagaaacagctgaagcagatgatt aaggatttagccaaagaaaaagataaaactgagaaagaattgcccaaaatgagccagaga gaatttatccagttctgtaaaactctgtacagtatgttccatgaagatccagaagaaaat gatttgtatcaagccatcgccacagtcaccacactgctgctgcagatcggggaggtgggg cagcgaggcagcagctctggaagctgctcccaggagtgtggggaggagctgcgggcttca gctccttctcctgaggactcggtttttgcagacactgggaagacgccccaggactcccag gcatttccagaggcggcagaaagggactggactgtctcccttgaacatattttagcttca cttctgactgaacagtcattagtcaacttttttgaaaagccactggacatgaaatccaaa cttgaaaatgccaagatcaatcagtacaatctcaaaacttttgaaatgagccaccaatca caatctgaacttaagctgagtaacttgtag