GENSCAN 1.0 Date run: 4-Nov-116 Time: 05:58:43 Sequence gi568815593f:90385568_90612136 : 226569 bp : 37.36% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 492 487 6 1.05 1.07 Term - 1548 1448 101 1 2 29 32 126 0.083 -1.59 1.06 Intr - 13982 13791 192 0 0 34 106 267 0.874 21.64 1.05 Intr - 20232 20118 115 0 1 61 62 122 0.971 6.00 1.04 Intr - 22267 22132 136 0 1 65 90 124 0.950 9.95 1.03 Intr - 24596 24360 237 1 0 72 64 232 0.102 15.01 1.02 Intr - 50653 50538 116 1 2 36 62 93 0.037 -0.17 1.01 Init - 51482 51411 72 2 0 62 98 38 0.265 3.32 1.00 Prom - 57937 57898 40 -3.65 2.03 PlyA - 62703 62698 6 1.05 2.02 Term - 75985 75600 386 2 2 84 35 296 0.778 17.97 2.01 Init - 83042 83024 19 2 1 56 94 11 0.221 -1.26 2.00 Prom - 84096 84057 40 -4.55 3.05 PlyA - 84375 84370 6 1.05 3.04 Term - 88989 88126 864 0 0 -19 48 546 0.241 31.61 3.03 Intr - 89263 89140 124 0 1 69 43 102 0.215 3.47 3.02 Intr - 89476 89389 88 2 1 54 28 106 0.361 -0.69 3.01 Init - 91303 91228 76 1 1 98 78 76 0.580 8.90 3.00 Prom - 93361 93322 40 -9.55 4.00 Prom + 93584 93623 40 -8.35 4.01 Init + 100001 100117 117 1 0 39 95 98 0.932 5.85 4.02 Intr + 102433 102562 130 0 1 51 79 112 0.996 5.95 4.03 Intr + 106490 106603 114 0 0 27 74 102 0.430 2.20 4.04 Intr + 110110 110166 57 2 0 127 93 27 0.656 5.04 4.05 Intr + 116342 116421 80 0 2 37 94 55 0.636 -0.65 4.06 Intr + 120961 121107 147 0 0 31 80 334 0.933 26.41 4.07 Term + 126486 126572 87 2 0 16 48 100 0.201 -4.52 4.08 PlyA + 128234 128239 6 1.05 5.06 PlyA - 128309 128304 6 1.05 5.05 Term - 133917 133252 666 0 0 134 43 442 0.999 37.14 5.04 Intr - 139733 139468 266 0 2 104 127 199 0.584 21.71 5.03 Intr - 144054 143881 174 1 0 42 103 193 0.961 15.19 5.02 Intr - 144464 144334 131 1 2 40 50 95 0.134 0.32 5.01 Init - 149750 149710 41 2 2 54 77 70 0.198 2.21 5.00 Prom - 168245 168206 40 -2.35 6.00 Prom + 172273 172312 40 -2.75 6.01 Init + 173009 173112 104 1 2 64 99 95 0.324 8.04 6.02 Intr + 173229 173350 122 0 2 12 96 55 0.075 -2.08 6.03 Intr + 190016 190353 338 0 2 93 88 132 0.012 8.01 6.04 Term + 190698 190784 87 2 0 73 54 26 0.011 -5.62 6.05 PlyA + 191142 191147 6 -1.75 7.08 PlyA - 191797 191792 6 1.05 7.07 Term - 194320 193498 823 0 1 -12 41 251 0.202 1.97 7.06 Intr - 195239 194950 290 2 2 21 72 145 0.157 1.32 7.05 Intr - 195614 195407 208 1 1 18 72 160 0.332 5.56 7.04 Intr - 200466 200336 131 0 2 94 100 -20 0.051 -1.63 7.03 Intr - 217159 217013 147 1 0 100 66 28 0.197 1.31 7.02 Intr - 218445 218371 75 0 0 85 77 27 0.139 0.09 7.01 Init - 224289 224164 126 0 0 52 91 108 0.840 7.71 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 24577 24360 218 1 2 67 64 222 0.834 15.92 S.002 Sngl + 58445 59152 708 1 0 86 49 149 0.826 6.78 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:90385568_90612136|GENSCAN_predicted_peptide_1|322_aa MAVNDEKMYHTLGTKSYVWVWKLKGRPEVSGTLAQEKFSTSEEWEVKDNYPSQLPSPVMK NPCRRVACSMSLPFPPPKLSIQFLNGGPQTARLYERGTQRPPAKQPHRDRRPDPQRIRGK THRANPKGESGPRLGGRGGVRRSELVVDKTKRKKRRELSEEQKQEIKDAFELFDTDKDEA IDYHELKVAMRALGFDVKKADVLKILKDYDREATGKITFEDFNEVVTDWILERDPHEEIL KAFKLFDDDDSGKISLRNLRRVARELGENMSDEELRAMIEEFDKDGDGESSRDLQLYNRI LDVLGSPGPLQCGKQLQKDSII >gi568815593f:90385568_90612136|GENSCAN_predicted_CDS_1|969_bp atggcagtgaatgatgagaaaatgtaccatacattaggcactaaatcttatgtctgggtg tggaagttgaaggggaggccagaagtgtctgggacgcttgcccaggagaaattctcaacc agcgaggaatgggaagttaaggataactacccttcccagcttcctagccctgtgatgaag aatccctgccggagggtggcctgcagcatgtcccttcccttcccaccgccgaagctgtcc atccagtttctgaatggcggcccacagaccgcacggctttacgagagaggaacacagcgc ccacccgccaaacagccacacagggatcggaggccggacccgcaaaggattcggggaaaa acgcatcgagccaatccgaagggcgagagcggccctagattgggggggcgggggggcgtt aggagaagtgagcttgtagtggacaaaacaaagaggaaaaaaagaagagaactgtctgag gaacagaaacaagaaattaaagatgcttttgaactatttgatacagacaaagatgaagca atagattatcatgaattaaaggtggcaatgagagccttggggtttgatgtaaaaaaagct gatgtactgaagattcttaaagattatgacagagaagccacagggaaaatcacctttgaa gattttaatgaagttgtgacagactggatattggaaagagatccccatgaagaaatactc aaggcatttaaactatttgatgatgatgattcaggtaaaataagcttgaggaatttgcga cgtgttgctagagaattgggtgaaaacatgagtgatgaagaacttcgagctatgatagaa gaatttgacaaagatggtgatggagaaagttcaagagatttgcagctctataatcggatc ttggatgttctgggatctccgggacctctacagtgtgggaaacagttacaaaaagatagt atcatctaa >gi568815593f:90385568_90612136|GENSCAN_predicted_peptide_2|134_aa MIQDMKGDVINLGDRQLTVMHMPGHSRGSICLHDKDRKILFSGDVVYDGSLIDWLPYSRI SDYVGTCERLIELVDRGLVEKVLPGHFNTFGAERLFRLASNYISKAGICHKVSTFAMRSL ASLALRVTNSRTSP >gi568815593f:90385568_90612136|GENSCAN_predicted_CDS_2|405_bp atgatacaagatatgaaaggggatgtgatcaaccttggtgacagacagctcactgttatg cacatgccaggtcactccaggggcagtatttgcttacatgacaaagaccgaaagattctc ttcagtggagacgtcgtgtatgatggatcactgattgactggctcccatacagcaggata agtgactatgttggaacttgtgaacgtctaatagaattagtggacagaggtctggtagag aaggtgcttcctgggcacttcaatacctttggtgctgaaaggctttttcgattggcttct aactatatttcaaaagctgggatatgtcacaaagtttctacttttgccatgcgatctctt gcaagtttagctctacgtgtaacaaattctaggacctcgccctag >gi568815593f:90385568_90612136|GENSCAN_predicted_peptide_3|383_aa MDAIVERVAEEVLSYLVIPEHKPDKIFDCFLGYLLPKSRSVGKTLQPKCRPARKGKAPPP RQRGFFALGRRRSRRTTREAPHFLRQLGVRALQSQWLLRPRAGGAASACCRPGGRVRERT LRAASVCVLPPPIPTPASHAPASLYASSLAAGRPLHTAAVPCVWTACGRLARPHSPRLAA PLPSMSALEWYAHKSLGDGIFWIQERFYESGNRANIWLVRGSEQDVVIDTGLGLRSLPEY LYSSGLLQDREAKEDAARRPLLAVATHVHFDHSGGLYQFDRVAVHHAEAEALARGDNFET VTWLSDSEVVRTPSPGWRARQFRVQAVQPTLILQDGNGPPRARALVKGGYWEETVRVPHV ADLRGHKGDYFGTSQSHFCFTLP >gi568815593f:90385568_90612136|GENSCAN_predicted_CDS_3|1152_bp atggatgctattgtggaaagggtggctgaagaagtcctctcttacttggtgatacctgag cataagcctgacaaaatcttcgattgcttcctcggttacctgctccctaagtctcgaagc gtgggcaaaactctgcagccgaagtgccggccggcaaggaaagggaaggccccaccccct cggcaaagaggtttttttgcccttggcaggcgccgtagtcgccgtacaacccgcgaggcc ccacactttctgcgccagctgggagtgcgcgccctgcagagccagtggctcctgcgcccc cgtgctggtggcgcagcctctgcttgctgccgacctggcggaagggtgcgggagcgcacg ctacgggcagcgtcggtctgcgtccttcctcctcctattccgacccccgcatcccacgcc cccgcctcgctctacgcctcctccctggctgcaggcagaccgcttcacacagccgcagtg ccctgtgtgtggacagcctgtgggagactcgcacgcccacactcaccccgcctggctgca cccctgcccagcatgtcggcgctcgagtggtacgcccacaagtctctaggcgatggtatc ttctggattcaagaacgtttctacgagtcgggcaaccgtgccaacatctggctggtgcgc ggctccgagcaggacgtggtgatcgatacaggcctggggctgcgcagcctcccggagtac ctgtactcctccggcctcttgcaggaccgagaggccaaagaggacgcggcgcgccggcca ctgcttgccgtggccacccacgtgcacttcgaccactccggcggcctctaccagttcgac cgcgtggcagtgcaccacgccgaggccgaggcgctggctcgcggggacaactttgagacc gtgacctggctttccgatagcgaggtggtgcggacgcccagccccggctggagggccaga cagttccgggtacaggcggtgcagcccaccctcatcctgcaggatggtaatgggcccccg cgggcgcgcgctctcgttaagggagggtattgggaagagactgtccgagtgccgcatgtt gcagacttgcgtggccataaaggggattattttggcacctcgcagagccacttctgtttc acccttccctga >gi568815593f:90385568_90612136|GENSCAN_predicted_peptide_4|243_aa MAGNKGRGRAAYTFNIEAVGFSKGEKLPDVVLKPPPLFPDTDYKPVPLKTGEGEEYMLAL KQELRETMKRMPYFIETPEERQEKHVPQFLDESEVKCKEIDSSLDAHDDDEIDCISKISD YIERYSKRYMKVYKEEWIPGPKPKKAKDAGKGTPLTNTEDVLKKMEELEKRGDGEKSDEE NEEKEGSKEKSKEGDDDDDDDAAEQEEYDEEEQEEENDYINSYFEDGDDFGADSDDNMDE ATY >gi568815593f:90385568_90612136|GENSCAN_predicted_CDS_4|732_bp atggctgggaataaaggaagaggacgtgctgcttatacctttaatattgaggctgttgga tttagcaaaggtgaaaagttacctgatgtagtgttgaaaccacccccactatttcctgat acagattataaaccagtgccactgaaaacaggagaaggtgaagaatatatgctggctttg aaacaggagttgagagaaacaatgaaaagaatgccttattttattgaaacacctgaagaa agacaagaaaaacatgttccacaattcttagacgaatcagaagtcaaatgcaaagagata gatagctctctggatgctcatgatgatgatgaaattgattgtataagcaaaatctcagac tatattgaaaggtatagtaaaagatacatgaaggtatacaaggaagaatggataccaggc ccaaaacccaaaaaggcaaaagacgcaggcaaaggcacaccactcactaatactgaagat gtgttgaaaaaaatggaggaattggaaaaaagaggtgatggtgaaaaatcagatgaggaa aatgaagagaaagaaggaagcaaagagaaaagtaaagaaggtgatgatgacgatgacgat gatgccgcagaacaggaggaatatgatgaagaagagcaagaagaggaaaatgactacatt aattcatactttgaagatggagatgattttggcgcagacagtgatgacaacatggatgag gcaacctattag >gi568815593f:90385568_90612136|GENSCAN_predicted_peptide_5|425_aa MADIEKKVNKAHGRKFEMHSKKKMQDSSPTHPEPRLVLRVHPTVSASAVWTLDKDAASLR TWRRRTARKAETLAAEAEADGVSPKQRLGHELTRDSGGGGPFGHLLAECSLLTGTDFNIM AGRHQNRSFPLPGVQSSGQVHAFGNCSDSDILEEDAEVYELRSRGKEKVRRSTSRDRLDD IIVLTKDIQEGDTLNAIALQYCCTVADIKRVNNLISDQDFFALRSIKIPVKKFSSLTETL CPPKGRQTSRHSSVQYSSEQQEILPANDSLAYSDSAGSFLKEVDRDIEQIVKCTDNKREN LNEVVSALTAQQMRFEPDNKNTQRKDPYYGADWGIGWWTAVVIMLIVGIITPVFYLLYYE ILAKVDVSHHSTVDSSHLHSKITPPSQQREMENGIVPTKGIHFSQQDDHKLYSQDSQSPA AQQET >gi568815593f:90385568_90612136|GENSCAN_predicted_CDS_5|1278_bp atggcagatattgagaaaaaggtgaacaaggctcatgggagaaaattcgagatgcacagc aagaaaaaaatgcaagacagctccccgacacatcctgagccacgcctcgtcctcagggtg cacccgacagtctctgccagcgcagtgtggacgctggacaaagacgcagccagtttgcgg acgtggcggcggcgtacggcccggaaggcggagacgttggcggcagaggcggaggcggac ggggtcagcccaaagcagaggctcggccatgaacttacccgggacagcggcggcggcgga ccttttggccatcttctcgcagagtgctccctgctaacggggacagattttaacattatg gcagggaggcatcagaatcgtagttttcctcttccaggagttcagtcaagtggtcaagta catgcatttggaaattgttcagacagtgatattttggaggaggatgctgaagtgtatgaa cttcgatccagaggaaaagagaaagtccgaagaagtacatcaagagatagacttgacgac attatagtattaacaaaagatatacaagaaggagatacattaaatgcaatagcccttcag tactgttgtacggtagcagatatcaagagagttaacaatctcatcagtgatcaagacttt tttgcccttaggtctatcaaaattccagtaaaaaagttcagttccttgaccgaaacactt tgtcctccaaaaggaagacagacttcacgtcattcatctgttcaatactcttccgaacaa caggaaattttgccagctaatgattctcttgcttacagtgactcagctggtagcttttta aaagaagtagaccgagacatagaacaaatagtaaagtgtacagacaataagagagagaac ctcaatgaggtagtatcggccttaacagcacaacaaatgcgttttgaacctgataacaaa aacactcaacgtaaagacccctattatggagcagactggggaatagggtggtggacagct gtagtgataatgttgatagtaggtataataacaccagtgttttatttgttgtattatgaa attttagctaaggtggatgttagtcatcattcaacagtggactcttcacatttacattca aaaatcacacccccatcacagcagagagaaatggaaaatggaattgtgccaactaaagga atacatttcagccaacaagatgatcataaactgtatagtcaagattctcagtcacctgct gctcaacaggaaacatag >gi568815593f:90385568_90612136|GENSCAN_predicted_peptide_6|216_aa MAKAGCSRRQSADLAARCGEQSAQPGAATPAPLVSGSKNQQRGQGVRTGVRGRARVCGGP AGTAGSARMSVFLGPGVSPPMNGQRKLHPRPILSPRWLWGPALPQGFVYRPGNFDHERQT PSNWRLTSGNCSDYSQRAYMGAGHPGEFSGMIQICLLGTEHHSPGSQCCLDSRKPVSAAP GVPLGVAEPPAPRQAPVCDVPRCVSRCSHCSVPTYE >gi568815593f:90385568_90612136|GENSCAN_predicted_CDS_6|651_bp atggccaaggcagggtgttcccgacgccagagcgcggacctggctgcccgctgcggagag cagagtgcgcagcccggggcggccacccccgctccgctggtcagtggtagtaagaatcag cagcgcgggcaaggagtacggacgggagtcagaggcagagcgagggtgtgtggagggccg gcggggaccgccgggagcgcgcggatgtcggtgttcctggggccaggggtttctcctccc atgaatggacagcgaaagcttcatcctagaccaatcctgtctccacgatggctttgggga cctgccttgccccagggttttgtctacaggcctggcaactttgaccatgagaggcagact ccctctaactggaggctgacaagtggaaactgttcggattattcccagagggcttacatg ggggcgggtcatcctggtgaattctcaggcatgatccaaatctgcctcctaggcacagaa caccattctccagggtcgcagtgctgcctggactccaggaagccagtctctgctgctcct ggtgtgcccctaggggtggccgagccccccgccccacgacaagccccagtgtgtgatgtt ccccgctgtgtgtccaggtgttctcattgttcagttcccacctatgagtga >gi568815593f:90385568_90612136|GENSCAN_predicted_peptide_7|599_aa MGQGPEKTNITSAELNERIDLISFIFYLVNAIASIFPWCDHQTRVKEESKKFLISMSFRG GKPVHKVLIFFFCSRITSRTPHYIGLSCFLGLLLAVTFPQSFLIFDDFGAFEECGQGLLQ AGGLQLEKELLSIFSFCLSILFFLTRLANWFPIPTKDRARKLKKLEKRLDEWLTRINSVE ETLNDLMELKNMAQELRDACTSFNSQFDQGEERITVIEDQINEIKWEDKETQRTPQRYSS RRATPTHITVRFTKDEMKEKMLRAAREKGWVTHKGKSIGLTADLSAETLQARREWEPIFN ILKEMNFQPRISYPAKLSEGESGKTRKIKRDKEGHYIMVKGSIQQEELTILNISAPNTGA PGFIKQVLRDLQRDLDSHTIIMGDFNTPLSTLDRSTRQKVNKDIQDMNSALHQVDLIDIY RTLHPKSTEYTFFSAPHHTYSKIDHIIGTKALLSKCKRTEITTNCLSDHSAIKLELGIKK LTQNHTTTWKLNNLLLNDYWVNNKMKAEIKMFFETNENKDTTYQNLWDIRKAVCRGKFIA LNAHKRKQGRSKIGTVTSQLKELEKQEQTHSKASRRQEITKIRAELKETETQNTLQKNQ >gi568815593f:90385568_90612136|GENSCAN_predicted_CDS_7|1800_bp atgggacaagggcctgaaaagacaaacattactagtgcagaattgaatgaacgcattgac cttattagcttcatcttttaccttgtcaatgccatagcaagcatcttcccatggtgtgac caccagactagggtcaaagaggaaagtaagaaatttcttatctccatgtcattcagaggc ggaaaaccagttcacaaggtgctaattttctttttctgttccaggatcacatccaggaca ccacattacattgggttgtcatgtttccttgggctccttttggctgtgacatttcctcag agttttcttatttttgatgactttggcgcttttgaggaatgtggtcagggactgctgcaa gctggtggcttacagctggagaaagagttattgtccatcttttctttttgcctgtcgata cttttcttcctcactcggctggccaactggtttcccatccccactaaggacagagcaagg aagctaaaaaaacttgaaaaaaggttagacgaatggctaactagaataaacagtgtagag gagactttaaatgacctgatggagctgaaaaacatggcacaagaacttcgtgacgcatgc acaagcttcaatagccaattcgatcaaggggaagaaaggataacagtgattgaagatcaa attaatgaaataaagtgggaagacaaggaaacacagagaacaccacaaagatactcctcc agaagagcaaccccaacacacataactgtcagattcaccaaggatgaaatgaaggaaaaa atgttaagggcagccagagagaaaggttgggttacccacaaagggaagtccatcggacta acagcggatctctcggcagaaaccctacaagccagaagagagtgggagccaatattcaac attcttaaagaaatgaattttcaacccagaatctcatatccagccaaactaagtgaagga gaaagcgggaagacaagaaagatcaaaagagacaaagaaggccattacataatggtaaag ggatcaattcaacaagaagaactaactatactaaatatatctgcacccaatacaggagca cccggattcataaagcaagtccttagagacctacaaagagacttagactcccacacaata ataatgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaggtt aacaaggatatccaggacatgaactcagctctgcaccaagtggacctaatagacatctac agaactctccaccccaaatcaacagaatatacattcttctcagcaccacatcacacttat tctaaaattgaccacataattggaaccaaagcactcctcagcaaatgtaaaagaacagaa atcacaacaaactgtctctcagaccacagtgcaatcaaattagaacttgggattaagaaa ctcactcaaaaccacacaactacatggaaactgaacaacctgctcctgaatgactactgg gtaaataacaaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagac acaacataccagaatctctgggacatacgtaaagcagtgtgtagagggaaatttatagca ctaaatgcccacaagagaaagcagggaagatctaaaatcggcaccgtaacatcacaatta aaagaactagagaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataact aagatcagagcagaactgaaggagacagagacacaaaacacccttcaaaaaaatcaatga